The strength of intron donor splice sites in human genes displays a bell-shaped pattern

Bioinformatics. 2011 Nov 15;27(22):3079-84. doi: 10.1093/bioinformatics/btr532. Epub 2011 Oct 12.

Abstract

Motivation: The gene concept has recently changed from the classical one protein notion into a much more diverse picture, where overlapping or fused transcripts, alternative transcription initiation, and genes within genes, add to the complexity generated by alternative splicing. Increased understanding of the mechanisms controlling pre-mRNA splicing is thus important for a wide range of aspects relating to gene expression.

Results: We have discovered a convex gene delineating pattern in the strength of 5' intron splice sites. When comparing the strengths of > 18,000 intron containing Human genes, we found that when analysing them separately according to the number of introns they contain, initial splice sites were always stronger on average than subsequent ones, and that a similar reversed trend exist towards the terminal gene part. The convex pattern is strongest for genes with up to 10 introns. Interestingly, when analysing the intron containing gene pool from mouse consisting of >15,000 genes, we found the convex pattern to be conserved despite > 75 million years of evolutionary divergence between the two organisms. We also analysed an interesting, novel class of chimeric genes which during spliceosome assembly are fused and in tandem are transcribed and spliced into a single mature mRNA sequence. In their splice site patterns, these genes individually seem to deviate from the convex pattern, offering a possible rationale behind their fusion into a single transcript.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing*
  • Animals
  • Humans
  • Introns*
  • Mice
  • RNA Precursors / metabolism
  • RNA Splice Sites*
  • RNA, Messenger / metabolism

Substances

  • RNA Precursors
  • RNA Splice Sites
  • RNA, Messenger