Annotation Extensions

Methods Mol Biol. 2017:1446:233-243. doi: 10.1007/978-1-4939-3743-1_17.

Abstract

The specificity of knowledge that Gene Ontology (GO) annotations currently can represent is still restricted by the legacy format of the GO annotation file, a format intentionally designed for simplicity to keep the barriers to entry low and thus encourage initial adoption. Historically, the information that could be captured in a GO annotation was simply the role or location of a gene product, although genetically interacting or binding partners could be specified. While there was no mechanism within the original GO annotation format for capturing additional information about the context of a GO term, such as the target gene of an activity or the location of a molecular function, the long-term vision for the GO Consortium was to provide greater expressivity in its annotations to capture physiologically relevant information.Thus, as a step forwards, the GO Consortium has introduced a new field into the annotation format, annotation extensions, which can be used to capture valuable contextual detail. This provides experimentally verified links between gene products and other physiological information that is crucial for accurate analysis of pathway and network data. This chapter will provide a simple overview of annotation extensions, illustrated with examples of their usage, and explain why they are useful for scientists and bioinformaticians alike.

Keywords: Analysis; Annotation; Annotation extension; Biocuration; Context; Gene Ontology; Network; Pathway.

MeSH terms

  • Animals
  • Computational Biology / methods
  • Databases, Genetic
  • Gene Ontology*
  • Humans
  • Molecular Sequence Annotation / methods*
  • Proteins / analysis
  • Proteins / genetics
  • Proteins / metabolism

Substances

  • Proteins