Title | The complete sequence of a human Y chromosome. |
Publication Type | Journal Article |
Year of Publication | 2023 |
Authors | Rhie, A, Nurk, S, Cechova, M, Hoyt, SJ, Taylor, DJ, Altemose, N, Hook, PW, Koren, S, Rautiainen, M, Alexandrov, IA, Allen, J, Asri, M, Bzikadze, AV, Chen, N-C, Chin, C-S, Diekhans, M, Flicek, P, Formenti, G, Fungtammasan, A, Girón, CGarcía, Garrison, E, Gershman, A, Gerton, JL, Grady, PGS, Guarracino, A, Haggerty, L, Halabian, R, Hansen, NF, Harris, R, Hartley, GA, Harvey, WT, Haukness, M, Heinz, J, Hourlier, T, Hubley, RM, Hunt, SE, Hwang, S, Jain, M, Kesharwani, RK, Lewis, AP, Li, H, Logsdon, GA, Lucas, JK, Makalowski, W, Markovic, C, Martin, FJ, Cartney, AMMc, McCoy, RC, McDaniel, J, McNulty, BM, Medvedev, P, Mikheenko, A, Munson, KM, Murphy, TD, Olsen, HE, Olson, ND, Paulin, LF, Porubsky, D, Potapova, T, Ryabov, F, Salzberg, SL, Sauria, MEG, Sedlazeck, FJ, Shafin, K, Shepelev, VA, Shumate, A, Storer, JM, Surapaneni, L, Oill, AMTaravell, Thibaud-Nissen, F, Timp, W, Tomaszkiewicz, M, Vollger, MR, Walenz, BP, Watwood, AC, Weissensteiner, MH, Wenger, AM, Wilson, MA, Zarate, S, Zhu, Y, Zook, JM, Eichler, EE, O'Neill, RJ, Schatz, MC, Miga, KH, Makova, KD, Phillippy, AM |
Journal | Nature |
Volume | 621 |
Issue | 7978 |
Pagination | 344-354 |
Date Published | 2023 Sep |
ISSN | 1476-4687 |
Keywords | Chromosomes, Human, Y, Genomics, Humans, Segmental Duplications, Genomic, Tandem Repeat Sequences, Telomere |
Abstract | The human Y chromosome has been notoriously difficult to sequence and assemble because of its complex repeat structure that includes long palindromes, tandem repeats and segmental duplications. As a result, more than half of the Y chromosome is missing from the GRCh38 reference sequence and it remains the last human chromosome to be finished. Here, the Telomere-to-Telomere (T2T) consortium presents the complete 62,460,029-base-pair sequence of a human Y chromosome from the HG002 genome (T2T-Y) that corrects multiple errors in GRCh38-Y and adds over 30 million base pairs of sequence to the reference, showing the complete ampliconic structures of gene families TSPY, DAZ and RBMY; 41 additional protein-coding genes, mostly from the TSPY family; and an alternating pattern of human satellite 1 and 3 blocks in the heterochromatic Yq12 region. We have combined T2T-Y with a previous assembly of the CHM13 genome and mapped available population variation, clinical variants and functional genomics data to produce a complete and comprehensive reference sequence for all 24 human chromosomes. |
DOI | 10.1038/s41586-023-06457-y |
Alternate Journal | Nature |
PubMed ID | 37612512 |
PubMed Central ID | 3975068 |
Grant List | R01 HG011274 / HG / NHGRI NIH HHS / United States R21 HG010548 / HG / NHGRI NIH HHS / United States U01 HG010971 / HG / NHGRI NIH HHS / United States |