EPIMUTESTR: a nearest neighbor machine learning approach to predict cancer driver genes from the evolutionary action of coding variants.

TitleEPIMUTESTR: a nearest neighbor machine learning approach to predict cancer driver genes from the evolutionary action of coding variants.
Publication TypeJournal Article
Year of Publication2022
AuthorsParvandeh, S, Donehower, LA, Panagiotis, K, Hsu, T-K, Asmussen, JK, Lee, K, Lichtarge, O
JournalNucleic Acids Res
Volume50
Issue12
Paginatione70
Date Published2022 Jul 08
ISSN1362-4962
KeywordsHumans, Machine Learning, Mutation, Neoplasms, Oncogenes, Phylogeny
Abstract

Discovering rare cancer driver genes is difficult because their mutational frequency is too low for statistical detection by computational methods. EPIMUTESTR is an integrative nearest-neighbor machine learning algorithm that identifies such marginal genes by modeling the fitness of their mutations with the phylogenetic Evolutionary Action (EA) score. Over cohorts of sequenced patients from The Cancer Genome Atlas representing 33 tumor types, EPIMUTESTR detected 214 previously inferred cancer driver genes and 137 new candidates never identified computationally before of which seven genes are supported in the COSMIC Cancer Gene Census. EPIMUTESTR achieved better robustness and specificity than existing methods in a number of benchmark methods and datasets.

DOI10.1093/nar/gkac215
Alternate JournalNucleic Acids Res
PubMed ID35412634
PubMed Central IDPMC9262594
Grant ListR01 AG061105 / AG / NIA NIH HHS / United States
R01 AG074009 / AG / NIA NIH HHS / United States
T15 LM007093 / LM / NLM NIH HHS / United States
U01 AG068214 / AG / NIA NIH HHS / United States

Similar Publications