Detection and evaluation of intron retention events in the human transcriptome
Open Access
- 20 April 2004
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in RNA
- Vol. 10 (5), 757-765
- https://doi.org/10.1261/rna.5123504
Abstract
Alternative splicing is a very frequent phenomenon in the human transcriptome. There are four major types of alternative splicing: exon skipping, alternative 3′ splice site, alternative 5′ splice site, and intron retention. Here we present a large-scale analysis of intron retention in a set of 21,106 known human genes. We observed that 14.8% of these genes showed evidence of at least one intron retention event. Most of the events are located within the untranslated regions (UTRs) of human transcripts. For those retained introns interrupting the coding region, the GC content, codon usage, and the frequency of stop codons suggest that these sequences are under selection for coding potential. Furthermore, 26% of the introns within the coding region participate in the coding of a protein domain. A comparison with mouse shows that at least 22% of all informative examples of retained introns in human are also present in the mouse transcriptome. We discuss that the data we present suggest that a significant fraction of the observed events is not spurious and might reflect biological significance. The analyses also allowed us to generate a reliable set of intron retention events that can be used for the identification of splicing regulatory elements.Keywords
This publication has 32 references indexed in Scilit:
- Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAsNature, 2002
- Initial sequencing and comparative analysis of the mouse genomeNature, 2002
- Selecting for Functional Alternative Splices in ESTsGenome Research, 2002
- Splice Variation in Mouse Full-Length cDNAs Identified by Mapping to the Mouse GenomeGenome Research, 2002
- Initial sequencing and analysis of the human genomeNature, 2001
- An Alternative-Exon Database and Its Statistical AnalysisDNA and Cell Biology, 2000
- A Greedy Algorithm for Aligning DNA SequencesJournal of Computational Biology, 2000
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- dbEST — database for “expressed sequence tags”Nature Genetics, 1993
- Regulation of Drosophila P element transpositionTrends in Genetics, 1991