Title | SVhound: detection of regions that harbor yet undetected structural variation. |
Publication Type | Journal Article |
Year of Publication | 2023 |
Authors | Paulin, LF, Raveendran, M, Harris, RA, Rogers, J, von Haeseler, A, Sedlazeck, FJ |
Journal | BMC Bioinformatics |
Volume | 24 |
Issue | 1 |
Pagination | 23 |
Date Published | 2023 Jan 20 |
ISSN | 1471-2105 |
Keywords | Alleles, Animals, Genome, Human, Genomic Structural Variation, High-Throughput Nucleotide Sequencing, Humans, Macaca mulatta, Polymorphism, Genetic, Software |
Abstract | BACKGROUND: Recent population studies are ever growing in number of samples to investigate the diversity of a population or species. These studies reveal new polymorphism that lead to important insights into the mechanisms of evolution, but are also important for the interpretation of these variations. Nevertheless, while the full catalog of variations across entire species remains unknown, we can predict which regions harbor additional not yet detected variations and investigate their properties, thereby enhancing the analysis for potentially missed variants. RESULTS: To achieve this we developed SVhound ( https://github.com/lfpaulin/SVhound ), which based on a population level SVs dataset can predict regions that harbor unseen SV alleles. We tested SVhound using subsets of the 1000 genomes project data and showed that its correlation (average correlation of 2800 tests r = 0.7136) is high to the full data set. Next, we utilized SVhound to investigate potentially missed or understudied regions across 1KGP and CCDG. Lastly we also apply SVhound on a small and novel SV call set for rhesus macaque (Macaca mulatta) and discuss the impact and choice of parameters for SVhound. CONCLUSIONS: SVhound is a unique method to identify potential regions that harbor hidden diversity in model and non model organisms and can also be potentially used to ensure high quality of SV call sets. |
DOI | 10.1186/s12859-022-05046-6 |
Alternate Journal | BMC Bioinformatics |
PubMed ID | 36670361 |
PubMed Central ID | PMC9854228 |
Grant List | P51 OD011104 / OD / NIH HHS / United States UM1 HG008898 / NH / NIH HHS / United States R24-OD-11173 / NH / NIH HHS / United States R24 OD011173 / OD / NIH HHS / United States UM1 HG008898 / HG / NHGRI NIH HHS / United States |
SVhound: detection of regions that harbor yet undetected structural variation.
Similar Publications
DNA Methylation-Derived Immune Cell Proportions and Cancer Risk in Black Participants. Cancer Res Commun. 2024;4(10):2714-2723. | .
Whole genomes of Amazonian uakari monkeys reveal complex connectivity and fast differentiation driven by high environmental dynamism. Commun Biol. 2024;7(1):1283. | .
StratoMod: predicting sequencing and variant calling errors with interpretable machine learning. Commun Biol. 2024;7(1):1316. | .