Title | Mining genetic epidemiology data with Bayesian networks application to APOE gene variation and plasma lipid levels. |
Publication Type | Journal Article |
Year of Publication | 2005 |
Authors | Rodin, A, Mosley, TH, Clark, AG, Sing, CF, Boerwinkle, E |
Journal | J Comput Biol |
Volume | 12 |
Issue | 1 |
Pagination | 1-11 |
Date Published | 2005 |
ISSN | 1066-5277 |
Keywords | Apolipoproteins E, Bayes Theorem, Databases, Factual, Genetics, Population, Humans, Linkage Disequilibrium, Lipids, Models, Genetic, Polymorphism, Single Nucleotide |
Abstract | There is a critical need for data-mining methods that can identify SNPs that predict among individual variation in a phenotype of interest and reverse-engineer the biological network of relationships between SNPs, phenotypes, and other factors. This problem is both challenging and important in light of the large number of SNPs in many genes of interest and across the human genome. A potentially fruitful form of exploratory data analysis is the Bayesian or Belief network. A Bayesian or Belief network provides an analytic approach for identifying robust predictors of among-individual variation in a disease endpoints or risk factor levels. We have applied Belief networks to SNP variation in the human APOE gene and plasma apolipoprotein E levels from two samples: 702 African-Americans from Jackson, MS, and 854 non-Hispanic whites from Rochester, MN. Twenty variable sites in the APOE gene were genotyped in both samples. In Jackson, MS, SNPs 4036 and 4075 were identified to influence plasma apoE levels. In Rochester, MN, SNPs 3937 and 4075 were identified to influence plasma apoE levels. All three SNPs had been previously implicated in affecting measures of lipid and lipoprotein metabolism. Like all data-mining methods, Belief networks are meant to complement traditional hypothesis-driven methods of data analysis. These results document the utility of a Belief network approach for mining large scale genotype-phenotype association data. |
DOI | 10.1089/cmb.2005.12.1 |
Alternate Journal | J Comput Biol |
PubMed ID | 15725730 |
PubMed Central ID | PMC1201451 |
Grant List | P50 GM065509 / GM / NIGMS NIH HHS / United States R01 HL072905 / HL / NHLBI NIH HHS / United States |
Mining genetic epidemiology data with Bayesian networks application to APOE gene variation and plasma lipid levels.
Similar Publications
Inverted triplications formed by iterative template switches generate structural variant diversity at genomic disorder loci. Cell Genom. 2024;4(7):100590. | .
Unveiling novel genetic variants in 370 challenging medically relevant genes using the long read sequencing data of 41 samples from 19 global populations. Mol Genet Genomics. 2024;299(1):65. | .
Genetic diversity of 1,845 rhesus macaques improves genetic variation interpretation and identifies disease models. Nat Commun. 2024;15(1):5658. | .