SH3-SPOT: an algorithm to predict preferred ligands to different members of the SH3 gene family

J Mol Biol. 2000 Apr 28;298(2):313-28. doi: 10.1006/jmbi.2000.3670.

Abstract

We have developed a procedure to predict the peptide binding specificity of an SH3 domain from its sequence. The procedure utilizes information extracted from position-specific contacts derived from six SH3/peptide or SH3/protein complexes of known structure. The framework of SH3/peptide contacts defined on the structure of the complexes is used to build a residue-residue interaction database derived from ligands obtained by panning peptide libraries displayed on filamentous phage. The SH3-specific interaction database is a multidimensional array containing frequencies of position-specific contacts. As input, SH3-SPOT requires the sequence of an SH3 domain and of a query decapeptide ligand. The array, that we call the SH3-specific matrix, is then used to evaluate the probability that the peptide would bind the given SH3 domain. This procedure is fast enough to be applied to the entire protein sequence database. Panning experiments were performed to search putative specific ligands of different SH3 domains in a database of decapeptides, or in a database of protein sequences. The procedure ranked some of the natural partners of interaction of a number of SH3 domains among the best ligands of the approximately 5. 6x10(9) different decapeptides in the SWISSPROT database. We expect the predictive power of the method to increase with the enrichment of the SH3-specific matrix by interaction data derived from new complex structures or from the characterization of new ligands. The procedure was developed using the SH3 domain family as test case but its application can easily be extended to other families of protein domains (such as, SH2, MHC, EH, PDZ, etc.).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Amino Acid Sequence
  • Animals
  • Binding Sites
  • Computational Biology / methods*
  • Databases, Factual
  • Humans
  • Ligands
  • Models, Molecular
  • Molecular Sequence Data
  • Multigene Family* / genetics
  • Peptide Fragments / chemistry
  • Peptide Fragments / genetics
  • Peptide Fragments / metabolism
  • Peptide Library
  • Probability
  • Protein Binding
  • Proteins / chemistry*
  • Proteins / genetics
  • Proteins / metabolism*
  • Sequence Alignment
  • Substrate Specificity
  • src Homology Domains*

Substances

  • Ligands
  • Peptide Fragments
  • Peptide Library
  • Proteins