Functional equivalence of genome sequencing analysis pipelines enables harmonized variant calling across human genetics projects.

TitleFunctional equivalence of genome sequencing analysis pipelines enables harmonized variant calling across human genetics projects.
Publication TypeJournal Article
Year of Publication2018
AuthorsRegier, AA, Farjoun, Y, Larson, DE, Krasheninina, O, Kang, HMin, Howrigan, DP, Chen, B-J, Kher, M, Banks, E, Ames, DC, English, AC, Li, H, Xing, J, Zhang, Y, Matise, T, Abecasis, GR, Salerno, W, Zody, MC, Neale, BM, Hall, IM
JournalNat Commun
Volume9
Issue1
Pagination4038
Date Published2018 Oct 02
ISSN2041-1723
KeywordsGenome, Human, Human Genetics, Humans, Whole Genome Sequencing
Abstract

Hundreds of thousands of human whole genome sequencing (WGS) datasets will be generated over the next few years. These data are more valuable in aggregate: joint analysis of genomes from many sources increases sample size and statistical power. A central challenge for joint analysis is that different WGS data processing pipelines cause substantial differences in variant calling in combined datasets, necessitating computationally expensive reprocessing. This approach is no longer tenable given the scale of current studies and data volumes. Here, we define WGS data processing standards that allow different groups to produce functionally equivalent (FE) results, yet still innovate on data processing pipelines. We present initial FE pipelines developed at five genome centers and show that they yield similar variant calling results and produce significantly less variability than sequencing replicates. This work alleviates a key technical bottleneck for genome aggregation and helps lay the foundation for community-wide human genetics studies.

DOI10.1038/s41467-018-06159-4
Alternate JournalNat Commun
PubMed ID30279509
PubMed Central IDPMC6168605
Grant ListUM1 HG008853 / / U.S. Department of Health & Human Services | NIH | National Human Genome Research Institute (NHGRI) / International
UM1 HG008901 / / U.S. Department of Health & Human Services | NIH | National Human Genome Research Institute (NHGRI) / International
UM1 HG008895 / HG / NHGRI NIH HHS / United States
UM1 HG008895 / / U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI) / International
R01 HG002818 / HG / NHGRI NIH HHS / United States
U24 HG008956 / / U.S. Department of Health & Human Services | NIH | National Human Genome Research Institute (NHGRI) / International
UM1 HG008901 / HG / NHGRI NIH HHS / United States
U01 HG00908 / / U.S. Department of Health & Human Services | NIH | National Human Genome Research Institute (NHGRI) / International
UM1 HG008898 / / U.S. Department of Health & Human Services | NIH | National Human Genome Research Institute (NHGRI) / International
UM1 HG008853 / HG / NHGRI NIH HHS / United States
3R01HL-117626-02S1 / / U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI) / International
R01 MH107649 / MH / NIMH NIH HHS / United States
U01 HL137182 / HL / NHLBI NIH HHS / United States
UM1 HG008895 / / U.S. Department of Health & Human Services | NIH | National Human Genome Research Institute (NHGRI) / International
R21 HL133758 / HL / NHLBI NIH HHS / United States
U01 HG009088 / HG / NHGRI NIH HHS / United States
UM1 HG008898 / HG / NHGRI NIH HHS / United States
R01 HL117626 / HL / NHLBI NIH HHS / United States
U24 HG008956 / HG / NHGRI NIH HHS / United States
R01 MH107649 / / U.S. Department of Health & Human Services | NIH | National Institute of Mental Health (NIMH) / International
4 R01 HL117626-04 / / U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI) / International

Similar Publications

Chen F, Zhang Y, Chandrashekar DS, Varambally S, Creighton CJ. Global impact of somatic structural variation on the cancer proteome. Nat Commun. 2023;14(1):5637.
Rhie A, Nurk S, Cechova M, Hoyt SJ, Taylor DJ, Altemose N, et al.. The complete sequence of a human Y chromosome. Nature. 2023;621(7978):344-354.
Saengboonmee C, Sorin S, Sangkhamanon S, Chomphoo S, Indramanee S, Seubwai W, et al.. γ-aminobutyric acid B2 receptor: A potential therapeutic target for cholangiocarcinoma in patients with diabetes mellitus. World J Gastroenterol. 2023;29(28):4416-4432.
Wojcik MH, Reuter CM, Marwaha S, Mahmoud M, Duyzend MH, Barseghyan H, et al.. Beyond the exome: What's next in diagnostic testing for Mendelian conditions. Am J Hum Genet. 2023;110(8):1229-1248.
Chin C-S, Behera S, Khalak A, Sedlazeck FJ, Sudmant PH, Wagner J, et al.. Multiscale analysis of pangenomes enables improved representation of genomic diversity for repetitive and clinically relevant genes. Nat Methods. 2023;20(8):1213-1221.
Zhao N, Teles F, Lu J, Koestler DC, Beck J, Boerwinkle E, et al.. Epigenome-wide association study using peripheral blood leukocytes identifies genomic regions associated with periodontal disease and edentulism in the Atherosclerosis Risk in Communities study. J Clin Periodontol. 2023;50(9):1140-1153.
Harris RA, McAllister JM, Strauss JF. Single-Cell RNA-Seq Identifies Pathways and Genes Contributing to the Hyperandrogenemia Associated with Polycystic Ovary Syndrome. Int J Mol Sci. 2023;24(13).
Qian X, Srinivasan T, He J, Chen R. The Role of Ceramide in Inherited Retinal Disease Pathology. Adv Exp Med Biol. 2023;1415:303-307.
Calame DG, Guo T, Wang C, Garrett L, Jolly A, Dawood M, et al.. Monoallelic variation in DHX9, the gene encoding the DExH-box helicase DHX9, underlies neurodevelopment disorders and Charcot-Marie-Tooth disease. Am J Hum Genet. 2023;110(8):1394-1413.
Walker KA, Chen J, Shi L, Yang Y, Fornage M, Zhou L, et al.. Proteomics analysis of plasma from middle-aged adults identifies protein markers of dementia risk in later life. Sci Transl Med. 2023;15(705):eadf5681.