Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program.

TitleSequencing of 53,831 diverse genomes from the NHLBI TOPMed Program.
Publication TypeJournal Article
Year of Publication2021
AuthorsTaliun, D, Harris, DN, Kessler, MD, Carlson, J, Szpiech, ZA, Torres, R, Taliun, SAGagliano, Corvelo, A, Gogarten, SM, Kang, HMin, Pitsillides, AN, LeFaive, J, Lee, S-B, Tian, X, Browning, BL, Das, S, Emde, A-K, Clarke, WE, Loesch, DP, Shetty, AC, Blackwell, TW, Smith, AV, Wong, Q, Liu, X, Conomos, MP, Bobo, DM, Aguet, F, Albert, C, Alonso, A, Ardlie, KG, Arking, DE, Aslibekyan, S, Auer, PL, Barnard, J, R Barr, G, Barwick, L, Becker, LC, Beer, RL, Benjamin, EJ, Bielak, LF, Blangero, J, Boehnke, M, Bowden, DW, Brody, JA, Burchard, EG, Cade, BE, Casella, JF, Chalazan, B, Chasman, DI, Chen, Y-DIda, Cho, MH, Choi, SHoan, Chung, MK, Clish, CB, Correa, A, Curran, JE, Custer, B, Darbar, D, Daya, M, de Andrade, M, DeMeo, DL, Dutcher, SK, Ellinor, PT, Emery, LS, Eng, C, Fatkin, D, Fingerlin, T, Forer, L, Fornage, M, Franceschini, N, Fuchsberger, C, Fullerton, SM, Germer, S, Gladwin, MT, Gottlieb, DJ, Guo, X, Hall, ME, He, J, Heard-Costa, NL, Heckbert, SR, Irvin, MR, Johnsen, JM, Johnson, AD, Kaplan, R, Kardia, SLR, Kelly, T, Kelly, S, Kenny, EE, Kiel, DP, Klemmer, R, Konkle, BA, Kooperberg, C, Köttgen, A, Lange, LA, Lasky-Su, J, Levy, D, Lin, X, Lin, K-H, Liu, C, Loos, RJF, Garman, L, Gerszten, R, Lubitz, SA, Lunetta, KL, C Y Mak, A, Manichaikul, A, Manning, AK, Mathias, RA, McManus, DD, McGarvey, ST, Meigs, JB, Meyers, DA, Mikulla, JL, Minear, MA, Mitchell, BD, Mohanty, S, Montasser, ME, Montgomery, C, Morrison, AC, Murabito, JM, Natale, A, Natarajan, P, Nelson, SC, North, KE, O'Connell, JR, Palmer, ND, Pankratz, N, Peloso, GM, Peyser, PA, Pleiness, J, Post, WS, Psaty, BM, Rao, DC, Redline, S, Reiner, AP, Roden, D, Rotter, JI, Ruczinski, I, Sarnowski, C, Schoenherr, S, Schwartz, DA, Seo, J-S, Seshadri, S, Sheehan, VA, Sheu, WH, M Shoemaker, B, Smith, NL, Smith, JA, Sotoodehnia, N, Stilp, AM, Tang, W, Taylor, KD, Telen, M, Thornton, TA, Tracy, RP, Van Den Berg, DJ, Vasan, RS, Viaud-Martinez, KA, Vrieze, S, Weeks, DE, Weir, BS, Weiss, ST, Weng, L-C, Willer, CJ, Zhang, Y, Zhao, X, Arnett, DK, Ashley-Koch, AE, Barnes, KC, Boerwinkle, E, Gabriel, S, Gibbs, RA, Rice, KM, Rich, SS, Silverman, EK, Qasba, P, Gan, W, Papanicolaou, GJ, Nickerson, DA, Browning, SR, Zody, MC, Zöllner, S, Wilson, JG, L Cupples, A, Laurie, CC, Jaquish, CE, Hernandez, RD, O'Connor, TD, Abecasis, GR
Corporate AuthorsNHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium
JournalNature
Volume590
Issue7845
Pagination290-299
Date Published2021 02
ISSN1476-4687
KeywordsCytochrome P-450 CYP2D6, Genetic Variation, Genome, Human, Genomics, Haplotypes, Heterozygote, Humans, INDEL Mutation, Loss of Function Mutation, Mutagenesis, National Heart, Lung, and Blood Institute (U.S.), Phenotype, Polymorphism, Single Nucleotide, Population Density, Precision Medicine, Quality Control, Sample Size, United States, Whole Genome Sequencing
Abstract

The Trans-Omics for Precision Medicine (TOPMed) programme seeks to elucidate the genetic architecture and biology of heart, lung, blood and sleep disorders, with the ultimate goal of improving diagnosis, treatment and prevention of these diseases. The initial phases of the programme focused on whole-genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here we describe the TOPMed goals and design as well as the available resources and early insights obtained from the sequence data. The resources include a variant browser, a genotype imputation server, and genomic and phenotypic data that are available through dbGaP (Database of Genotypes and Phenotypes). In the first 53,831 TOPMed samples, we detected more than 400 million single-nucleotide and insertion or deletion variants after alignment with the reference genome. Additional previously undescribed variants were detected through assembly of unmapped reads and customized analysis in highly variable loci. Among the more than 400 million detected variants, 97% have frequencies of less than 1% and 46% are singletons that are present in only one individual (53% among unrelated individuals). These rare variants provide insights into mutational processes and recent human evolutionary history. The extensive catalogue of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and noncoding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and reach of genome-wide association studies to include variants down to a frequency of approximately 0.01%.

DOI10.1038/s41586-021-03205-y
Alternate JournalNature
PubMed ID33568819
PubMed Central IDPMC7875770
Grant ListR35 HG010692 / HG / NHGRI NIH HHS / United States
R01 HL120393 / HL / NHLBI NIH HHS / United States
P50 HL118006 / HL / NHLBI NIH HHS / United States
T32 HG000040 / HG / NHGRI NIH HHS / United States
R01 HL142711 / HL / NHLBI NIH HHS / United States
K08 HL141601 / HL / NHLBI NIH HHS / United States
HHSN268201800001C / HL / NHLBI NIH HHS / United States
K24 HL148521 / HL / NHLBI NIH HHS / United States
R21 HL123677 / HL / NHLBI NIH HHS / United States
R01 MD012765 / MD / NIMHD NIH HHS / United States
R01 HL117626 / HL / NHLBI NIH HHS / United States
R01 HL131565 / HL / NHLBI NIH HHS / United States
R01 HL149836 / HL / NHLBI NIH HHS / United States
U01 HL120393 / HL / NHLBI NIH HHS / United States
R01 DK117445 / DK / NIDDK NIH HHS / United States