Identifying remote protein homologs by network propagation

FEBS J. 2005 Oct;272(20):5119-28. doi: 10.1111/j.1742-4658.2005.04947.x.

Abstract

Perhaps the most widely used applications of bioinformatics are tools such as psi-blast for searching sequence databases. We describe a recently developed protein database search algorithm called rankprop. rankprop relies upon a precomputed network of pairwise protein similarities. The algorithm performs a diffusion operation from a specified query protein across the protein similarity network. The resulting activation scores, assigned to each database protein, encode information about the global structure of the protein similarity network. This type of algorithm has a rich history in associationist psychology, artificial intelligence and web search. We describe the rankprop algorithm and its relatives, and we provide evidence that the algorithm successfully improves upon the rankings produced by psi-blast.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Review

MeSH terms

  • Algorithms*
  • Bacterial Proteins / genetics
  • Computational Biology / methods*
  • Databases, Protein
  • Internet
  • Photoreceptors, Microbial / genetics
  • Protein Structure, Tertiary / genetics
  • Proteins / genetics
  • ROC Curve
  • Sequence Alignment / methods*

Substances

  • Bacterial Proteins
  • Photoreceptors, Microbial
  • Proteins
  • photoactive yellow protein, Bacteria