Syntactic pattern analysis of 5'-splice site sequences of mRNA precursors in higher eukaryote genes

Comput Appl Biosci. 1987 Nov;3(4):319-24. doi: 10.1093/bioinformatics/3.4.319.

Abstract

The signals which direct the excision of introns from eukaryotic pre-mRNA are not yet well understood. In order to define the signals for 5'-splice sites of mRNA splicing, nucleotide sequences including 5'-splice junctions of mammalian pre-mRNAs are analysed by means of syntactic pattern analysis. Taking this approach, we infer the grammatical rules which specify 5'-splice sites and construct a finite automaton which is the recognizer of the nucleotide sequences at 5'-splice sites. By scanning the automaton along nucleotide sequences, we can identify the positions of 5'-splice junctions with a degree of discrimination of up to 94-97% in the known genes, while the degree of prediction is in the range 50-55% in new genes.

MeSH terms

  • Algorithms
  • Animals
  • Base Sequence
  • Eukaryotic Cells
  • Humans
  • Introns
  • Pattern Recognition, Automated*
  • RNA Precursors*
  • RNA Splicing*

Substances

  • RNA Precursors