Functional annotation by identification of local surface similarities: a novel tool for structural genomics

BMC Bioinformatics. 2005 Aug 2:6:194. doi: 10.1186/1471-2105-6-194.

Abstract

Background: Protein function is often dependent on subsets of solvent-exposed residues that may exist in a similar three-dimensional configuration in non homologous proteins thus having different order and/or spacing in the sequence. Hence, functional annotation by means of sequence or fold similarity is not adequate for such cases.

Results: We describe a method for the function-related annotation of protein structures by means of the detection of local structural similarity with a library of annotated functional sites. An automatic procedure was used to annotate the function of local surface regions. Next, we employed a sequence-independent algorithm to compare exhaustively these functional patches with a larger collection of protein surface cavities. After tuning and validating the algorithm on a dataset of well annotated structures, we applied it to a list of protein structures that are classified as being of unknown function in the Protein Data Bank. By this strategy, we were able to provide functional clues to proteins that do not show any significant sequence or global structural similarity with proteins in the current databases.

Conclusion: This method is able to spot structural similarities associated to function-related similarities, independently on sequence or fold resemblance, therefore is a valuable tool for the functional analysis of uncharacterized proteins. Results are available at http://cbm.bio.uniroma2.it/surface/structuralGenomics.html.

Publication types

  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Algorithms
  • Binding Sites
  • Computational Biology*
  • Databases, Protein*
  • False Negative Reactions
  • Genomics / methods
  • Information Storage and Retrieval / methods
  • Internet
  • Molecular Sequence Data
  • Pattern Recognition, Automated
  • Proteins / chemistry*
  • Proteins / classification*
  • Proteins / metabolism
  • Sequence Analysis, Protein / methods*

Substances

  • Proteins