ConsPred: a rule-based (re-)annotation framework for prokaryotic genomes

Bioinformatics. 2016 Nov 1;32(21):3327-3329. doi: 10.1093/bioinformatics/btw393. Epub 2016 Jul 4.

Abstract

Motivation: The rapidly growing number of available prokaryotic genome sequences requires fully automated and high-quality software solutions for their initial and re-annotation. Here we present ConsPred, a prokaryotic genome annotation framework that performs intrinsic gene predictions, homology searches, predictions of non-coding genes as well as CRISPR repeats and integrates all evidence into a consensus annotation. ConsPred achieves comprehensive, high-quality annotations based on rules and priorities, similar to decision-making in manual curation and avoids conflicting predictions. Parameters controlling the annotation process are configurable by the user. ConsPred has been used in the institutions of the authors for longer than 5 years and can easily be extended and adapted to specific needs.

Summary: The ConsPred algorithm for producing a consensus from the varying scores of multiple gene prediction programs approaches manual curation in accuracy. Its rule-based approach for choosing final predictions avoids overriding previous manual curations.

Availability and implementation: ConsPred is implemented in Java, Perl and Shell and is freely available under the Creative Commons license as a stand-alone in-house pipeline or as an Amazon Machine Image for cloud computing, see https://sourceforge.net/projects/conspred/.

Contact: thomas.rattei@univie.ac.atSupplementary information: Supplementary data are available at Bioinformatics online.

MeSH terms

  • Algorithms
  • Genome*
  • Prokaryotic Cells*
  • Software*