MethPhaser: methylation-based long-read haplotype phasing of human genomes.

TitleMethPhaser: methylation-based long-read haplotype phasing of human genomes.
Publication TypeJournal Article
Year of Publication2024
AuthorsFu, Y, Aganezov, S, Mahmoud, M, Beaulaurier, J, Juul, S, Treangen, TJ, Sedlazeck, FJ
JournalNat Commun
Volume15
Issue1
Pagination5327
Date Published2024 Jun 22
ISSN2041-1723
KeywordsCell Line, DNA Methylation, Genome, Human, Haplotypes, Humans, Mutation, Polymorphism, Single Nucleotide
Abstract

The assignment of variants across haplotypes, phasing, is crucial for predicting the consequences, interaction, and inheritance of mutations and is a key step in improving our understanding of phenotype and disease. However, phasing is limited by read length and stretches of homozygosity along the genome. To overcome this limitation, we designed MethPhaser, a method that utilizes methylation signals from Oxford Nanopore Technologies to extend Single Nucleotide Variation (SNV)-based phasing. We demonstrate that haplotype-specific methylations extensively exist in Human genomes and the advent of long-read technologies enabled direct report of methylation signals. For ONT R9 and R10 cell line data, we increase the phase length N50 by 78%-151% at a phasing accuracy of 83.4-98.7% To assess the impact of tissue purity and random methylation signals due to inactivation, we also applied MethPhaser on blood samples from 4 patients, still showing improvements over SNV-only phasing. MethPhaser further improves phasing across HLA and multiple other medically relevant genes, improving our understanding of how mutations interact across multiple phenotypes. The concept of MethPhaser can also be extended to non-human diploid genomes. MethPhaser is available at https://github.com/treangenlab/methphaser .

DOI10.1038/s41467-024-49588-0
Alternate JournalNat Commun
PubMed ID38909018
PubMed Central IDPMC11193733
Grant ListP01 AI152999 / AI / NIAID NIH HHS / United States
U01 HG011758 / HG / NHGRI NIH HHS / United States
UM1 HG008898 / HG / NHGRI NIH HHS / United States

Similar Publications