Computational analysis of splicing errors and mutations in human transcripts

BMC Genomics. 2008 Jan 14:9:13. doi: 10.1186/1471-2164-9-13.

Abstract

Background: Most retained introns found in human cDNAs generated by high-throughput sequencing projects seem to result from underspliced transcripts, and thus they capture intermediate steps of pre-mRNA splicing. On the other hand, mutations in splice sites cause exon skipping of the respective exon or activation of pre-existing cryptic sites. Both types of events reflect properties of the splicing mechanism.

Results: The retained introns were significantly shorter than constitutive ones, and skipped exons are shorter than exons with cryptic sites. Both donor and acceptor splice sites of retained introns were weaker than splice sites of constitutive introns. The authentic acceptor sites affected by mutations were significantly weaker in exons with activated cryptic sites than in skipped exons. The distance from a mutated splice site to the nearest equivalent site is significantly shorter in cases of activated cryptic sites compared to exon skipping events. The prevalence of retained introns within genes monotonically increased in the 5'-to-3' direction (more retained introns close to the 3'-end), consistent with the model of co-transcriptional splicing. The density of exonic splicing enhancers was higher, and the density of exonic splicing silencers lower in retained introns compared to constitutive ones and in exons with cryptic sites compared to skipped exons.

Conclusion: Thus the analysis of retained introns in human cDNA, exons skipped due to mutations in splice sites and exons with cryptic sites produced results consistent with the intron definition mechanism of splicing of short introns, co-transcriptional splicing, dependence of splicing efficiency on the splice site strength and the density of candidate exonic splicing enhancers and silencers. These results are consistent with other, recently published analyses.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing / genetics
  • Computational Biology / statistics & numerical data
  • DNA Mutational Analysis
  • Databases, Nucleic Acid
  • Evolution, Molecular
  • Exons / genetics*
  • Genetic Variation
  • Genomics
  • Human Genome Project
  • Humans
  • Introns / genetics*
  • Models, Genetic
  • Mutation / genetics*
  • RNA Precursors
  • RNA Splice Sites / genetics*
  • RNA Splicing / genetics*
  • Regulatory Elements, Transcriptional / genetics
  • Transcription, Genetic / genetics*

Substances

  • RNA Precursors
  • RNA Splice Sites