Title | Scalable Open Science Approach for Mutation Calling of Tumor Exomes Using Multiple Genomic Pipelines. |
Publication Type | Journal Article |
Year of Publication | 2018 |
Authors | Ellrott, K, Bailey, MH, Saksena, G, Covington, KR, Kandoth, C, Stewart, C, Hess, J, Ma, S, Chiotti, KE, McLellan, M, Sofia, HJ, Hutter, C, Getz, G, Wheeler, D, Ding, L |
Corporate Authors | MC3 Working Group, Cancer Genome Atlas Research Network |
Journal | Cell Syst |
Volume | 6 |
Issue | 3 |
Pagination | 271-281.e7 |
Date Published | 2018 Mar 28 |
ISSN | 2405-4712 |
Keywords | Algorithms, Exome, Exome Sequencing, Genomics, High-Throughput Nucleotide Sequencing, Humans, Information Dissemination, Mutation, Neoplasms, Sequence Analysis, DNA, Software |
Abstract | The Cancer Genome Atlas (TCGA) cancer genomics dataset includes over 10,000 tumor-normal exome pairs across 33 different cancer types, in total >400 TB of raw data files requiring analysis. Here we describe the Multi-Center Mutation Calling in Multiple Cancers project, our effort to generate a comprehensive encyclopedia of somatic mutation calls for the TCGA data to enable robust cross-tumor-type analyses. Our approach accounts for variance and batch effects introduced by the rapid advancement of DNA extraction, hybridization-capture, sequencing, and analysis methods over time. We present best practices for applying an ensemble of seven mutation-calling algorithms with scoring and artifact filtering. The dataset created by this analysis includes 3.5 million somatic variants and forms the basis for PanCan Atlas papers. The results have been made available to the research community along with the methods used to generate them. This project is the result of collaboration from a number of institutes and demonstrates how team science drives extremely large genomics projects. |
DOI | 10.1016/j.cels.2018.03.002 |
Alternate Journal | Cell Syst |
PubMed ID | 29596782 |
PubMed Central ID | PMC6075717 |
Grant List | P30 CA016672 / CA / NCI NIH HHS / United States U24 CA143882 / CA / NCI NIH HHS / United States U24 CA143866 / CA / NCI NIH HHS / United States U54 HG003273 / HG / NHGRI NIH HHS / United States U24 CA143840 / CA / NCI NIH HHS / United States U24 CA143843 / CA / NCI NIH HHS / United States U24 CA143858 / CA / NCI NIH HHS / United States U24 CA143848 / CA / NCI NIH HHS / United States R01 CA183793 / CA / NCI NIH HHS / United States R01 CA180778 / CA / NCI NIH HHS / United States U24 CA210949 / CA / NCI NIH HHS / United States R01 CA163722 / CA / NCI NIH HHS / United States U24 CA143867 / CA / NCI NIH HHS / United States U24 CA210990 / CA / NCI NIH HHS / United States U54 HG003067 / HG / NHGRI NIH HHS / United States U24 CA143835 / CA / NCI NIH HHS / United States U01 HG006517 / HG / NHGRI NIH HHS / United States U24 CA210950 / CA / NCI NIH HHS / United States U24 CA143845 / CA / NCI NIH HHS / United States U24 CA143799 / CA / NCI NIH HHS / United States P30 CA008748 / CA / NCI NIH HHS / United States U24 CA144025 / CA / NCI NIH HHS / United States U24 CA210957 / CA / NCI NIH HHS / United States U54 HG003079 / HG / NHGRI NIH HHS / United States U24 CA210969 / CA / NCI NIH HHS / United States U54 HG007990 / HG / NHGRI NIH HHS / United States U24 CA143883 / CA / NCI NIH HHS / United States U24 CA211006 / CA / NCI NIH HHS / United States |
Scalable Open Science Approach for Mutation Calling of Tumor Exomes Using Multiple Genomic Pipelines.
Similar Publications
Single cell dual-omic atlas of the human developing retina. Nat Commun. 2024;15(1):6792. | .
Improved high quality sand fly assemblies enabled by ultra low input long read sequencing. Sci Data. 2024;11(1):918. | .
Loss of symmetric cell division of apical neural progenitors drives DENND5A-related developmental and epileptic encephalopathy. Nat Commun. 2024;15(1):7239. | .