mGene.web: a web service for accurate computational gene finding

Nucleic Acids Res. 2009 Jul;37(Web Server issue):W312-6. doi: 10.1093/nar/gkp479. Epub 2009 Jun 3.

Abstract

We describe mGene.web, a web service for the genome-wide prediction of protein coding genes from eukaryotic DNA sequences. It offers pre-trained models for the recognition of gene structures including untranslated regions in an increasing number of organisms. With mGene.web, users have the additional possibility to train the system with their own data for other organisms on the push of a button, a functionality that will greatly accelerate the annotation of newly sequenced genomes. The system is built in a highly modular way, such that individual components of the framework, like the promoter prediction tool or the splice site predictor, can be used autonomously. The underlying gene finding system mGene is based on discriminative machine learning techniques and its high accuracy has been demonstrated in an international competition on nematode genomes. mGene.web is available at http://www.mgene.org/web, it is free of charge and can be used for eukaryotic genomes of small to moderate size (several hundred Mbp).

MeSH terms

  • Genes*
  • Genomics*
  • Internet
  • Proteins / genetics*
  • RNA Splice Sites
  • Sequence Analysis, DNA
  • Software*
  • Transcription Initiation Site

Substances

  • Proteins
  • RNA Splice Sites