Assessing the relationship between conservation of function and conservation of sequence using photosynthetic proteins

Bioinformatics. 2012 Dec 15;28(24):3203-10. doi: 10.1093/bioinformatics/bts608. Epub 2012 Oct 18.

Abstract

Motivation: Assessing the false positive rate of function prediction methods is difficult, as it is hard to establish that a protein does not have a certain function. To determine to what extent proteins with similar sequences have a common function, we focused on photosynthesis-related proteins. A protein that comes from a non-photosynthetic organism is, undoubtedly, not involved in photosynthesis.

Results: We show that function diverges very rapidly: 70% of the close homologs of photosynthetic proteins come from non-photosynthetic organisms. Therefore, high sequence similarity, in most cases, is not tantamount to similar function. However, we found that many functionally similar proteins often share short sequence elements, which may correspond to a functional site and could reveal functional similarities more accurately than sequence similarity.

Conclusions: These results shed light on the way biological function is conserved in evolution and may help improve large-scale analysis of protein function.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Motifs
  • Amino Acid Sequence
  • Conserved Sequence*
  • Molecular Sequence Annotation
  • Photosynthesis*
  • Proteins / chemistry
  • Proteins / physiology
  • Sequence Homology, Amino Acid*

Substances

  • Proteins