Title | An integrative variant analysis suite for whole exome next-generation sequencing data. |
Publication Type | Journal Article |
Year of Publication | 2012 |
Authors | Challis, D, Yu, J, Evani, US, Jackson, AR, Paithankar, S, Coarfa, C, Milosavljevic, A, Gibbs, RA, Yu, F |
Journal | BMC Bioinformatics |
Volume | 13 |
Pagination | 8 |
Date Published | 2012 Jan 12 |
ISSN | 1471-2105 |
Keywords | Exome, Genome, Human, Genomics, High-Throughput Nucleotide Sequencing, Humans, INDEL Mutation, Open Reading Frames, Polymorphism, Single Nucleotide, Software |
Abstract | BACKGROUND: Whole exome capture sequencing allows researchers to cost-effectively sequence the coding regions of the genome. Although the exome capture sequencing methods have become routine and well established, there is currently a lack of tools specialized for variant calling in this type of data.RESULTS: Using statistical models trained on validated whole-exome capture sequencing data, the Atlas2 Suite is an integrative variant analysis pipeline optimized for variant discovery on all three of the widely used next generation sequencing platforms (SOLiD, Illumina, and Roche 454). The suite employs logistic regression models in conjunction with user-adjustable cutoffs to accurately separate true SNPs and INDELs from sequencing and mapping errors with high sensitivity (96.7%).CONCLUSION: We have implemented the Atlas2 Suite and applied it to 92 whole exome samples from the 1000 Genomes Project. The Atlas2 Suite is available for download at http://sourceforge.net/projects/atlas2/. In addition to a command line version, the suite has been integrated into the Genboree Workbench, allowing biomedical scientists with minimal informatics expertise to remotely call, view, and further analyze variants through a simple web interface. The existing genomic databases displayed via the Genboree browser also streamline the process from variant discovery to functional genomics analysis, resulting in an off-the-shelf toolkit for the broader community. |
DOI | 10.1186/1471-2105-13-8 |
Alternate Journal | BMC Bioinformatics |
PubMed ID | 22239737 |
PubMed Central ID | PMC3292476 |
Grant List | 1U01HG005211-0109 / HG / NHGRI NIH HHS / United States 5U54HG003273 / HG / NHGRI NIH HHS / United States U54 HG003273 / HG / NHGRI NIH HHS / United States R01HG004009 / HG / NHGRI NIH HHS / United States U01DA025956 / DA / NIDA NIH HHS / United States |
An integrative variant analysis suite for whole exome next-generation sequencing data.
Similar Publications
Single cell dual-omic atlas of the human developing retina. Nat Commun. 2024;15(1):6792. | .
Improved high quality sand fly assemblies enabled by ultra low input long read sequencing. Sci Data. 2024;11(1):918. | .
Loss of symmetric cell division of apical neural progenitors drives DENND5A-related developmental and epileptic encephalopathy. Nat Commun. 2024;15(1):7239. | .