Title | Gene structure in the sea urchin Strongylocentrotus purpuratus based on transcriptome analysis. |
Publication Type | Journal Article |
Year of Publication | 2012 |
Authors | Tu, Q, R Cameron, A, Worley, KC, Gibbs, RA, Davidson, EH |
Journal | Genome Res |
Volume | 22 |
Issue | 10 |
Pagination | 2079-87 |
Date Published | 2012 Oct |
ISSN | 1549-5469 |
Keywords | Animals, Computational Biology, Databases, Nucleic Acid, Exons, Female, Gene Expression Profiling, Genome, Introns, Male, Models, Genetic, Molecular Sequence Annotation, Molecular Sequence Data, Open Reading Frames, RNA, Spliced Leader, Strongylocentrotus purpuratus, Trans-Splicing, Transcriptome |
Abstract | A comprehensive transcriptome analysis has been performed on protein-coding RNAs of Strongylocentrotus purpuratus, including 10 different embryonic stages, six feeding larval and metamorphosed juvenile stages, and six adult tissues. In this study, we pooled the transcriptomes from all of these sources and focused on the insights they provide for gene structure in the genome of this recently sequenced model system. The genome had initially been annotated by use of computational gene model prediction algorithms. A large fraction of these predicted genes were recovered in the transcriptome when the reads were mapped to the genome and appropriately filtered and analyzed. However, in a manually curated subset, we discovered that more than half the computational gene model predictions were imperfect, containing errors such as missing exons, prediction of nonexistent exons, erroneous intron/exon boundaries, fusion of adjacent genes, and prediction of multiple genes from single genes. The transcriptome data have been used to provide a systematic upgrade of the gene model predictions throughout the genome, very greatly improving the research usability of the genomic sequence. We have constructed new public databases that incorporate information from the transcriptome analyses. The transcript-based gene model data were used to define average structural parameters for S. purpuratus protein-coding genes. In addition, we constructed a custom sea urchin gene ontology, and assigned about 7000 different annotated transcripts to 24 functional classes. Strong correlations became evident between given functional ontology classes and structural properties, including gene size, exon number, and exon and intron size. |
DOI | 10.1101/gr.139170.112 |
Alternate Journal | Genome Res |
PubMed ID | 22709795 |
PubMed Central ID | PMC3460201 |
Grant List | P40OD010959 / OD / NIH HHS / United States P40 OD010959 / OD / NIH HHS / United States U54 HG003273 / HG / NHGRI NIH HHS / United States P40RR015044 / RR / NCRR NIH HHS / United States P40 RR015044 / RR / NCRR NIH HHS / United States |
Gene structure in the sea urchin Strongylocentrotus purpuratus based on transcriptome analysis.
Similar Publications
DNA Methylation-Derived Immune Cell Proportions and Cancer Risk in Black Participants. Cancer Res Commun. 2024;4(10):2714-2723. | .
Whole genomes of Amazonian uakari monkeys reveal complex connectivity and fast differentiation driven by high environmental dynamism. Commun Biol. 2024;7(1):1283. | .
Rare variant contribution to the heritability of coronary artery disease. Nat Commun. 2024;15(1):8741. | .