云端基因组学(影印版)(英文版)(美)杰拉尔丁·A.范德奥维拉//布里安·D.奥康钠东南大学出版社豆瓣PDF电子书bt网盘迅雷下载科学技术-自然科学-生物科学-霍普软件下载网

基因组学领域的数据正在剧增。在短短几年内，美国国家卫生研究院(National Institutes of Health，NIH)等组织托管的基因组数据已经超过了50PB(5OOO万GB)，这些组织正在转向云基础架构，以便将数据提供给研究团体。你该如何调整分析工具和协议来访问和分析云端的海量数据？
通过这本实用书籍，研究人员将学会如何使用基因组分析工具包(Genome Analysis Toolkit，GATK)、Docker、WDL、Terra等开源工具来处理基因组学算法。GATk用户社区的长期监理人Geraldine Van der Auwera和加州大学圣克鲁兹基因组学研究所的Brian O’Connor会指导你完成这一过程。你将通过使用真实数据和相关领域的基因组学算法展开学习。
本书涵盖了：
基本的基因组学和计算技术背景；
基本的云计算操作；
GATK入门，加上三个主要的GATK最佳实践；
使用WDL和Cromwell编写的脚本化工作流进行自动分析；
扩展云端的工作流执行，包括并行化和成本优化；
使用Jupyter notebook在云端进行交互式分析；
使用Terra确保协作和计算可重复性。

Foreword
Preface
1. Introduction
The Promises and Challenges of Big Data in Biology and Life Sciences
Infrastructure Challenges
Toward a Cloud-Based Ecosystem for Data Sharing and Analysis
Cloud-Hosted Data and Compute
Platforms for Research in the Life Sciences
Standardization and Reuse of Infrastructure
Being FAIR
Wrap-Up and Next Steps
2. Genomics in a Nutshell: A Primer for Newcomers to the Field
Introduction to Genomics
The Gene as a Discrete Unit of Inheritance (Sort Of)
The Central Dogma of Biology: DNA to RNA to Protein
The Origins and Consequences of DNA Mutations
Genomics as an Inventory of Variation in and Among Genomes
The Challenge of Genomic Scale, by the Numbers
Genomic Variation
The Reference Genome as Common Framework
Physical Classification of Variants
Germline Variants Versus Somatic Alterations
High-Throughput Sequencing Data Generation
From Biological Sample to Huge Pile of Read Data
Types of DNA Libraries: Choosing the Right Experimental Design
Data Processing and Analysis
Mapping Reads to the Reference Genome
Variant Calling
Data Quality and Sources of Error
Functional Equivalence Pipeline Specification
Wrap-Up and Next Steps
3. Computing Technology Basics for Life Scientists
Basic Infrastructure Components and Performance Bottlenecks
Types of Processor Hardware: CPU, GPU, TPU, FPGA, OMG
Levels of Compute Organization: Core, Node, Cluster, and Cloud
Addressing Performance Bottlenecks
Parallel Computing
Parallelizing a Simple Analysis
From Cores to Clusters and Clouds: Many Levels of Parallelism
Trade-Offs of Parallelism: Speed, Efficiency, and Cost
Pipelining for ParaUelization and Automation
Workflow Languages
Popular Pipelining Languages for Genomics
Workflow Management Systems
Virtualization and the CIoud
VMs and Containers
Introducing the Cloud
Categories of Research Use Cases for Cloud Services
Wrap-Up and Next Steps
4. First Steps in the Cloud
Setting Up Your Google Cloud Account and First Project
Creating a Project
Checking Your Billing Account and Activating Free Credits
Running Basic Commands in Google Cloud Shell
Logging in to the Cloud Shell VM
Using gsutil to Access and Manage Files
Pulling a Docker Image and Spinning Up the Container
Mounting a Volume to Access the Filesystem from Within the Container
Setting Up Your Own Custom VM
Creating and Configuring Your VM Instance
Logging into Your VM by Using SSH
Checking Your Authentication
Copying the Book Materials to Your VM
Installing Docker on Your VM
Setting Up the GATK Container Image
……
6. GATK Best Practices for Germline Short Variant Discovery
7. GATK Best Practices for Somatic Variant Discovery
8. Automatina Analysis Execution with Workflows
9. Deciphering Real Genomics Workflows
10. Running Single Workflows at Scale with Pipelines API
11. Running Many Workflows Conveniently in Terra
12. Interactive Analysis in Jupyter Notebook
13. Assembling Your Own Workspace in Terra
14. Making a Fully Reproducible Paper
Glossary
Index

书名	云端基因组学(影印版)(英文版)
分类	科学技术-自然科学-生物科学
作者	(美)杰拉尔丁·A.范德奥维拉//布里安·D.奥康钠
出版社	东南大学出版社
下载
简介	内容推荐基因组学领域的数据正在剧增。在短短几年内，美国国家卫生研究院(National Institutes of Health，NIH)等组织托管的基因组数据已经超过了50PB(5OOO万GB)，这些组织正在转向云基础架构，以便将数据提供给研究团体。你该如何调整分析工具和协议来访问和分析云端的海量数据？通过这本实用书籍，研究人员将学会如何使用基因组分析工具包(Genome Analysis Toolkit，GATK)、Docker、WDL、Terra等开源工具来处理基因组学算法。GATk用户社区的长期监理人Geraldine Van der Auwera和加州大学圣克鲁兹基因组学研究所的Brian O’Connor会指导你完成这一过程。你将通过使用真实数据和相关领域的基因组学算法展开学习。本书涵盖了：基本的基因组学和计算技术背景；基本的云计算操作； GATK入门，加上三个主要的GATK最佳实践；使用WDL和Cromwell编写的脚本化工作流进行自动分析；扩展云端的工作流执行，包括并行化和成本优化；使用Jupyter notebook在云端进行交互式分析；使用Terra确保协作和计算可重复性。作者简介杰拉尔丁·A.范德奥维拉博士是麻省理工学院一哈佛大学博德研究所数据科学平台的外联和沟通负责人。目录 Foreword Preface 1. Introduction The Promises and Challenges of Big Data in Biology and Life Sciences Infrastructure Challenges Toward a Cloud-Based Ecosystem for Data Sharing and Analysis Cloud-Hosted Data and Compute Platforms for Research in the Life Sciences Standardization and Reuse of Infrastructure Being FAIR Wrap-Up and Next Steps 2. Genomics in a Nutshell: A Primer for Newcomers to the Field Introduction to Genomics The Gene as a Discrete Unit of Inheritance (Sort Of) The Central Dogma of Biology: DNA to RNA to Protein The Origins and Consequences of DNA Mutations Genomics as an Inventory of Variation in and Among Genomes The Challenge of Genomic Scale, by the Numbers Genomic Variation The Reference Genome as Common Framework Physical Classification of Variants Germline Variants Versus Somatic Alterations High-Throughput Sequencing Data Generation From Biological Sample to Huge Pile of Read Data Types of DNA Libraries: Choosing the Right Experimental Design Data Processing and Analysis Mapping Reads to the Reference Genome Variant Calling Data Quality and Sources of Error Functional Equivalence Pipeline Specification Wrap-Up and Next Steps 3. Computing Technology Basics for Life Scientists Basic Infrastructure Components and Performance Bottlenecks Types of Processor Hardware: CPU, GPU, TPU, FPGA, OMG Levels of Compute Organization: Core, Node, Cluster, and Cloud Addressing Performance Bottlenecks Parallel Computing Parallelizing a Simple Analysis From Cores to Clusters and Clouds: Many Levels of Parallelism Trade-Offs of Parallelism: Speed, Efficiency, and Cost Pipelining for ParaUelization and Automation Workflow Languages Popular Pipelining Languages for Genomics Workflow Management Systems Virtualization and the CIoud VMs and Containers Introducing the Cloud Categories of Research Use Cases for Cloud Services Wrap-Up and Next Steps 4. First Steps in the Cloud Setting Up Your Google Cloud Account and First Project Creating a Project Checking Your Billing Account and Activating Free Credits Running Basic Commands in Google Cloud Shell Logging in to the Cloud Shell VM Using gsutil to Access and Manage Files Pulling a Docker Image and Spinning Up the Container Mounting a Volume to Access the Filesystem from Within the Container Setting Up Your Own Custom VM Creating and Configuring Your VM Instance Logging into Your VM by Using SSH Checking Your Authentication Copying the Book Materials to Your VM Installing Docker on Your VM Setting Up the GATK Container Image …… 6. GATK Best Practices for Germline Short Variant Discovery 7. GATK Best Practices for Somatic Variant Discovery 8. Automatina Analysis Execution with Workflows 9. Deciphering Real Genomics Workflows 10. Running Single Workflows at Scale with Pipelines API 11. Running Many Workflows Conveniently in Terra 12. Interactive Analysis in Jupyter Notebook 13. Assembling Your Own Workspace in Terra 14. Making a Fully Reproducible Paper Glossary Index
随便看	悠然天下评《谁袖盈华年》我喜欢总体谈100的文有本事你娶我江南雪问红尘风看见我的泪遇见你，真好天涯霜雪若寒霄为谁心痛穿越之颠覆江山美男一直在奔跑芳草萋萋那个男孩教会我爱倒沉淀的世界柳若空诚柳若空诚櫻咲冰帝帝王之恋评《福清宫主之凤鸣紫禁》墨式路猫公子于姬缘圈纯白深黑 Jemic窗口置顶工具新毒霸隐私清理 bhoo桌面程祥软件Ghost 硬盘序列号查询工具天天mac地址修改器小败智能关机输入法清理工具男人桌面 bgswitch.exe 辐射4Enclave X-02黑魔动力装甲MOD v1.42 星露谷物语纹理替换框架补丁 v1.0 僵尸毁灭工程附加的技能书MOD v1.67 魔兽世界大脚插件免费版 V8.3 生死狙击V10号账号密码没人挤电脑版获取器生死狙击V10号账号密码电脑版真的v2021无尽终焉版切尔诺贝利人DLC解锁 v1.75 我的世界1.16.5石头掉水里变为沙子MOD v1.9 我的世界1.12.2落石为沙MOD v1.0 钢铁收割六项修改器Steam v1.2.3.2474 王国之心3蓝色索拉MOD v3.65 mates' rate matey mathematician mathematics matins matinée matinée idol matriarch matriarchy matricide [BT下载][星河长明][第13集][WEB-MP4/1.45G][国语配音/中文字幕][4K-2160P][H265][BlackTV] 剧集 2022 大陆剧情连载 [BT下载][静雪][第09集][WEB-MKV/1.04G][中文字幕][1080P][KKTV] 剧集 2022 日本剧情连载 [BT下载][风吹半夏][第27集][WEB-MP4/0.68G][国语配音/中文字幕][1080P][BlackTV] 剧集 2022 大陆剧情连载 [BT下载][风吹半夏][第27集][WEB-MKV/2.20G][国语配音/中文字幕][4K-2160P][H265][BlackTV] 剧集 2022 大陆剧情连载 [BT下载][我们的当打之年][第21-22集][WEB-MP4/0.94G][国语配音/中文字幕][1080P][SeeWEB] 剧集 2022 大陆剧情连载 [BT下载][我们的当打之年][第21-22集][WEB-MKV/1.16G][国语配音/中文字幕][4K-2160P][H265][SeeWEB] 剧集 2022 大陆剧情连载 [BT下载][县委大院][第05-06集][WEB-MP4/1.26G][国语配音/中文字幕][4K-2160P][H265][SeeWEB] 剧集 2022 大陆剧情连载 [BT下载][回声三号][第01-03集][WEB-MKV/27.16G][简繁英字幕][4K-2160P][Apple][BlackTV] 剧集 2022 美国剧情连载 [BT下载][拜托了！8小时][第07-08集][WEB-MP4/1.35G][国语配音/中文字幕][1080P][SeeWEB] 剧集 2022 大陆剧情连载 [BT下载][法证先锋5][第19-20集][WEB-MKV/1.92G][国语配音/中文字幕][1080P][KKTV] 剧集 2022 香港悬疑连载青桔单车怎么上锁-青桔单车上锁的操作方法咪咕爱看流量使用范围有哪些-咪咕爱看流量使用范围介绍招商银行一网通账户怎么注销-招商银行一网通账户的注销流程招商银行一网通怎么开通-开通招商银行一网通账户的方法随申办市民云怎么查社保-随申办市民云查社保的操作流程有品有鱼app如何下载-有品有鱼app下载方法介绍河南掌上工商app怎么注销营业执照-注销营业执照的操作流程京东慧采和京东有什么区别-京东慧采和京东区别介绍美团单车怎么使用-使用美团单车的具体方法介绍美团单车怎么关锁-美团单车关锁的操作方法