Evaluating Computational Gene Ontology Annotations

Methods Mol Biol. 2017:1446:97-109. doi: 10.1007/978-1-4939-3743-1_8.

Abstract

Two avenues to understanding gene function are complementary and often overlapping: experimental work and computational prediction. While experimental annotation generally produces high-quality annotations, it is low throughput. Conversely, computational annotations have broad coverage, but the quality of annotations may be variable, and therefore evaluating the quality of computational annotations is a critical concern.In this chapter, we provide an overview of strategies to evaluate the quality of computational annotations. First, we discuss why evaluating quality in this setting is not trivial. We highlight the various issues that threaten to bias the evaluation of computational annotations, most of which stem from the incompleteness of biological databases. Second, we discuss solutions that address these issues, for example, targeted selection of new experimental annotations and leveraging the existing experimental annotations.

Keywords: Annotation; Evaluation; Function; Gene ontology; Prediction; Tools.

MeSH terms

  • Animals
  • Computational Biology / methods*
  • Computer Simulation
  • Databases, Genetic
  • Gene Ontology*
  • Genome
  • Humans
  • Models, Biological
  • Molecular Sequence Annotation / methods*
  • Proteins / genetics
  • Proteins / metabolism

Substances

  • Proteins