Efficient gene-environment interaction tests for large biobank-scale sequencing studies.

TitleEfficient gene-environment interaction tests for large biobank-scale sequencing studies.
Publication TypeJournal Article
Year of Publication2020
AuthorsWang, X, Lim, E, Liu, C-T, Sung, YJu, Rao, DC, Morrison, AC, Boerwinkle, E, Manning, AK, Chen, H
JournalGenet Epidemiol
Date Published2020 Nov
KeywordsBiological Specimen Banks, Body Mass Index, Computer Simulation, Exome, Exome Sequencing, Female, Gene-Environment Interaction, Humans, Linear Models, Male, Models, Genetic, Obesity, Phenotype, Quantitative Trait, Heritable, Time Factors

Complex human diseases are affected by genetic and environmental risk factors and their interactions. Gene-environment interaction (GEI) tests for aggregate genetic variant sets have been developed in recent years. However, existing statistical methods become rate limiting for large biobank-scale sequencing studies with correlated samples. We propose efficient Mixed-model Association tests for GEne-Environment interactions (MAGEE), for testing GEI between an aggregate variant set and environmental exposures on quantitative and binary traits in large-scale sequencing studies with related individuals. Joint tests for the aggregate genetic main effects and GEI effects are also developed. A null generalized linear mixed model adjusting for covariates but without any genetic effects is fit only once in a whole genome GEI analysis, thereby vastly reducing the overall computational burden. Score tests for variant sets are performed as a combination of genetic burden and variance component tests by accounting for the genetic main effects using matrix projections. The computational complexity is dramatically reduced in a whole genome GEI analysis, which makes MAGEE scalable to hundreds of thousands of individuals. We applied MAGEE to the exome sequencing data of 41,144 related individuals from the UK Biobank, and the analysis of 18,970 protein coding genes finished within 10.4 CPU hours.

Alternate JournalGenet Epidemiol
PubMed ID32864785
PubMed Central IDPMC7754763
Grant ListR00 HL130593 / HL / NHLBI NIH HHS / United States
R01 HL145025 / HL / NHLBI NIH HHS / United States

Similar Publications

Chen F, Zhang Y, Chandrashekar DS, Varambally S, Creighton CJ. Global impact of somatic structural variation on the cancer proteome. Nat Commun. 2023;14(1):5637.
Rhie A, Nurk S, Cechova M, Hoyt SJ, Taylor DJ, Altemose N, et al.. The complete sequence of a human Y chromosome. Nature. 2023;621(7978):344-354.
Saengboonmee C, Sorin S, Sangkhamanon S, Chomphoo S, Indramanee S, Seubwai W, et al.. γ-aminobutyric acid B2 receptor: A potential therapeutic target for cholangiocarcinoma in patients with diabetes mellitus. World J Gastroenterol. 2023;29(28):4416-4432.
Wojcik MH, Reuter CM, Marwaha S, Mahmoud M, Duyzend MH, Barseghyan H, et al.. Beyond the exome: What's next in diagnostic testing for Mendelian conditions. Am J Hum Genet. 2023;110(8):1229-1248.
Chin C-S, Behera S, Khalak A, Sedlazeck FJ, Sudmant PH, Wagner J, et al.. Multiscale analysis of pangenomes enables improved representation of genomic diversity for repetitive and clinically relevant genes. Nat Methods. 2023;20(8):1213-1221.
Harris RA, McAllister JM, Strauss JF. Single-Cell RNA-Seq Identifies Pathways and Genes Contributing to the Hyperandrogenemia Associated with Polycystic Ovary Syndrome. Int J Mol Sci. 2023;24(13).
Qian X, Srinivasan T, He J, Chen R. The Role of Ceramide in Inherited Retinal Disease Pathology. Adv Exp Med Biol. 2023;1415:303-307.
Calame DG, Guo T, Wang C, Garrett L, Jolly A, Dawood M, et al.. Monoallelic variation in DHX9, the gene encoding the DExH-box helicase DHX9, underlies neurodevelopment disorders and Charcot-Marie-Tooth disease. Am J Hum Genet. 2023;110(8):1394-1413.
Walker KA, Chen J, Shi L, Yang Y, Fornage M, Zhou L, et al.. Proteomics analysis of plasma from middle-aged adults identifies protein markers of dementia risk in later life. Sci Transl Med. 2023;15(705):eadf5681.
Zhao N, Teles F, Lu J, Koestler DC, Beck J, Boerwinkle E, et al.. Epigenome-wide association study using peripheral blood leukocytes identifies genomic regions associated with periodontal disease and edentulism in the Atherosclerosis Risk in Communities study. J Clin Periodontol. 2023;50(9):1140-1153.