Current methods for automated annotation of protein-coding genes

Curr Opin Insect Sci. 2015 Feb:7:8-14. doi: 10.1016/j.cois.2015.02.008. Epub 2015 Mar 7.

Abstract

We review software tools for gene prediction - the identification of protein-coding genes and their structure in genome sequences. The discussed approaches include methods based on RNA-Seq and current methods based on homology - comparative gene prediction and protein spliced alignments. Many methods require that their parameters are adjusted to the target species or its broader clade. These include ab initio gene finders, integrated approaches with ab initio components and some aligners. We also review current automatic methods for training for the common case that a bona fide training set of gene structures is not available before annotation.

Publication types

  • Review