Characterizing linkage disequilibrium and evaluating imputation power of human genomic insertion-deletion polymorphisms.

TitleCharacterizing linkage disequilibrium and evaluating imputation power of human genomic insertion-deletion polymorphisms.
Publication TypeJournal Article
Year of Publication2012
AuthorsLu, JT, Wang, Y, Gibbs, RA, Yu, F
JournalGenome Biol
Volume13
Issue2
PaginationR15
Date Published2012 Feb 29
ISSN1474-760X
KeywordsGene Frequency, Genetics, Population, Genome, Human, Genome-Wide Association Study, HapMap Project, Heterozygote, Humans, INDEL Mutation, Linkage Disequilibrium, Polymorphism, Genetic, Polymorphism, Single Nucleotide
Abstract

BACKGROUND: Indels are an important cause of human variation and central to the study of human disease. The 1000 Genomes Project Low-Coverage Pilot identified over 1.3 million indels shorter than 50 bp, of which over 890 were identified as potentially disruptive variants. Yet, despite their ubiquity, the local genomic characteristics of indels remain unexplored.RESULTS: Herein we describe population- and minor allele frequency-based differences in linkage disequilibrium and imputation characteristics for indels included in the 1000 Genomes Project Low-Coverage Pilot for the CEU, YRI and CHB+JPT populations. Common indels were well tagged by nearby SNPs in all studied populations, and were also tagged at a similar rate to common SNPs. Both neutral and functionally deleterious common indels were imputed with greater than 95% concordance from HapMap Phase 3 and OMNI SNP sites. Further, 38 to 56% of low frequency indels were tagged by low frequency SNPs. We were able to impute heterozygous low frequency indels with over 50% concordance. Lastly, our analysis also revealed evidence of ascertainment bias. This bias prevents us from extending the applicability of our results to highly polymorphic indels that could not be identified in the Low-Coverage Pilot.CONCLUSIONS: Although further scope exists to improve the imputation of low frequency indels, our study demonstrates that there are already ample opportunities to retrospectively impute indels for prior genome-wide association studies and to incorporate indel imputation into future case/control studies.

DOI10.1186/gb-2012-13-2-r15
Alternate JournalGenome Biol
PubMed ID22377349
PubMed Central IDPMC3334570
Grant ListU54 HG003273 / HG / NHGRI NIH HHS / United States
1U01HG005211-0109 / HG / NHGRI NIH HHS / United States
5U54HG003273 / HG / NHGRI NIH HHS / United States

Similar Publications