StackTIS: a stacked generalization approach for effective prediction of translation initiation sites

Comput Biol Med. 2012 Jan;42(1):61-9. doi: 10.1016/j.compbiomed.2011.10.009. Epub 2011 Nov 12.

Abstract

The prediction of the translation initiation site in an mRNA or cDNA sequence is an essential step in gene prediction and an open research problem in bioinformatics. Although recent approaches perform well, more effective and reliable methodologies are solicited. We developed an adaptable data mining method, called StackTIS, which is modular and consists of three prediction components that are combined into a meta-classification system, using stacked generalization, in a highly effective framework. We performed extensive experiments on sequences of two diverse eukaryotic organisms (Homo sapiens and Oryza sativa), indicating that StackTIS achieves statistically significant improvement in performance.

MeSH terms

  • Bayes Theorem
  • Computational Biology / methods*
  • DNA, Complementary / genetics
  • Data Mining / methods*
  • Databases, Genetic
  • Humans
  • Oryza / genetics
  • Protein Biosynthesis*
  • RNA, Messenger / genetics
  • Sequence Analysis, DNA
  • Sequence Analysis, RNA
  • Support Vector Machine*
  • Transcription Initiation Site*

Substances

  • DNA, Complementary
  • RNA, Messenger