Title | Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. |
Publication Type | Journal Article |
Year of Publication | 2005 |
Authors | Siepel, A, Bejerano, G, Pedersen, JS, Hinrichs, AS, Hou, M, Rosenbloom, K, Clawson, H, Spieth, J, Hillier, LDW, Richards, S, Weinstock, GM, Wilson, RK, Gibbs, RA, W Kent, J, Miller, W, Haussler, D |
Journal | Genome Res |
Volume | 15 |
Issue | 8 |
Pagination | 1034-50 |
Date Published | 2005 Aug |
ISSN | 1088-9051 |
Keywords | 3' Untranslated Regions, Animals, Base Pairing, Base Sequence, Caenorhabditis elegans, Conserved Sequence, DNA, Intergenic, Evolution, Molecular, Genome, Humans, Insecta, Molecular Sequence Data, Saccharomyces, Vertebrates, Yeasts |
Abstract | We have conducted a comprehensive search for conserved elements in vertebrate genomes, using genome-wide multiple alignments of five vertebrate species (human, mouse, rat, chicken, and Fugu rubripes). Parallel searches have been performed with multiple alignments of four insect species (three species of Drosophila and Anopheles gambiae), two species of Caenorhabditis, and seven species of Saccharomyces. Conserved elements were identified with a computer program called phastCons, which is based on a two-state phylogenetic hidden Markov model (phylo-HMM). PhastCons works by fitting a phylo-HMM to the data by maximum likelihood, subject to constraints designed to calibrate the model across species groups, and then predicting conserved elements based on this model. The predicted elements cover roughly 3%-8% of the human genome (depending on the details of the calibration procedure) and substantially higher fractions of the more compact Drosophila melanogaster (37%-53%), Caenorhabditis elegans (18%-37%), and Saccharaomyces cerevisiae (47%-68%) genomes. From yeasts to vertebrates, in order of increasing genome size and general biological complexity, increasing fractions of conserved bases are found to lie outside of the exons of known protein-coding genes. In all groups, the most highly conserved elements (HCEs), by log-odds score, are hundreds or thousands of bases long. These elements share certain properties with ultraconserved elements, but they tend to be longer and less perfectly conserved, and they overlap genes of somewhat different functional categories. In vertebrates, HCEs are associated with the 3' UTRs of regulatory genes, stable gene deserts, and megabase-sized regions rich in moderately conserved noncoding sequences. Noncoding HCEs also show strong statistical evidence of an enrichment for RNA secondary structure. |
DOI | 10.1101/gr.3715005 |
Alternate Journal | Genome Res |
PubMed ID | 16024819 |
PubMed Central ID | PMC1182216 |
Grant List | P41 HG002371 / HG / NHGRI NIH HHS / United States R01 HG002238 / HG / NHGRI NIH HHS / United States IP41HG02371 / HG / NHGRI NIH HHS / United States HG02238 / HG / NHGRI NIH HHS / United States |
Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes.
Similar Publications
DNA Methylation-Derived Immune Cell Proportions and Cancer Risk in Black Participants. Cancer Res Commun. 2024;4(10):2714-2723. | .
Whole genomes of Amazonian uakari monkeys reveal complex connectivity and fast differentiation driven by high environmental dynamism. Commun Biol. 2024;7(1):1283. | .
StratoMod: predicting sequencing and variant calling errors with interpretable machine learning. Commun Biol. 2024;7(1):1316. | .