Title | Pash 2.0: scaleable sequence anchoring for next-generation sequencing technologies. |
Publication Type | Journal Article |
Year of Publication | 2008 |
Authors | Coarfa, C, Milosavljevic, A |
Journal | Pac Symp Biocomput |
Pagination | 102-13 |
Date Published | 2008 |
ISSN | 2335-6928 |
Keywords | Algorithms, Animals, Computational Biology, Databases, Genetic, Evolution, Molecular, Genome, Human, Humans, Sensitivity and Specificity, Sequence Alignment, Software |
Abstract | Many applications of next-generation sequencing technologies involve anchoring of a sequence fragment or a tag onto a corresponding position on a reference genome assembly. Positional Hashing method, implemented in the Pash 2.0 program, is specifically designed for the task of high-volume anchoring. In this article we present multi-diagonal gapped kmer collation and other improvements introduced in Pash 2.0 that further improve accuracy and speed of Positional Hashing. The goal of this article is to show that gapped kmer matching with cross-diagonal collation suffices for anchoring across close evolutionary distances and for the purpose of human resequencing. We propose a benchmark for evaluating the performance of anchoring programs that captures key parameters in specific applications, including duplicative structure of genomes of humans and other species. We demonstrate speedups of up to tenfold in large-scale anchoring experiments achieved by PASH 2.0 when compared to BLAT, another similarity search program frequently used for anchoring. |
Alternate Journal | Pac Symp Biocomput |
PubMed ID | 18229679 |
Grant List | 1R33CA114151-01A1 / CA / NCI NIH HHS / United States 5R01HG004009-02 / HG / NHGRI NIH HHS / United States |
Pash 2.0: scaleable sequence anchoring for next-generation sequencing technologies.
Similar Publications
Genetic diversity of 1,845 rhesus macaques improves genetic variation interpretation and identifies disease models. Nat Commun. 2024;15(1):5658. | .
PRL1 and PRL3 promote macropinocytosis via its lipid phosphatase activity. Theranostics. 2024;14(9):3423-3438. | .
A single cell RNA sequence atlas of the early Drosophila larval eye. BMC Genomics. 2024;25(1):616. | .