Protein Function Prediction

Methods Mol Biol. 2017:1654:55-75. doi: 10.1007/978-1-4939-7231-9_5.

Abstract

Protein function is a concept that can have different interpretations in different biological contexts, and the number and diversity of novel proteins identified by large-scale "omics" technologies poses increasingly new challenges. In this review we explore current strategies used to predict protein function focused on high-throughput sequence analysis, as for example, inference based on sequence similarity, sequence composition, structure, and protein-protein interaction. Various prediction strategies are discussed together with illustrative workflows highlighting the use of some benchmark tools and knowledge bases in the field.

Keywords: Bioinformatics; Biological databases; Database sequence similarity search; Homology; Ontology; Phylogeny; Protein domains; Protein families; Protein function.

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Databases, Protein
  • Phylogeny
  • Proteins / chemistry*
  • Proteins / classification
  • Sequence Alignment
  • Sequence Analysis, Protein
  • Software*

Substances

  • Proteins