Structural bioinformatics prediction of membrane-binding proteins

J Mol Biol. 2006 Jun 2;359(2):486-95. doi: 10.1016/j.jmb.2006.03.039. Epub 2006 Mar 30.

Abstract

Membrane-binding peripheral proteins play important roles in many biological processes, including cell signaling and membrane trafficking. Unlike integral membrane proteins, these proteins bind the membrane mostly in a reversible manner. Since peripheral proteins do not have canonical transmembrane segments, it is difficult to identify them from their amino acid sequences. As a first step toward genome-scale identification of membrane-binding peripheral proteins, we built a kernel-based machine learning protocol. Key features of known membrane-binding proteins, including electrostatic properties and amino acid composition, were calculated from their amino acid sequences and tertiary structures, which were then incorporated into the support vector machine to perform the classification. A data set of 40 membrane-binding proteins and 230 non-membrane-binding proteins was used to construct and validate the protocol. Cross-validation and holdout evaluation of the protocol showed that the accuracy of the prediction reached up to 93.7% and 91.6%, respectively. The protocol was applied to the prediction of membrane-binding properties of four C2 domains from novel protein kinases C. Although these C2 domains have 50% sequence identity, only one of them was predicted to bind the membrane, which was verified experimentally with surface plasmon resonance analysis. These results suggest that our protocol can be used for predicting membrane-binding properties of a wide variety of modular domains and may be further extended to genome-scale identification of membrane-binding peripheral proteins.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Artificial Intelligence
  • Cell Membrane / metabolism
  • Computational Biology*
  • Databases, Protein
  • Membrane Proteins / chemistry*
  • Membrane Proteins / metabolism
  • Models, Molecular
  • Models, Theoretical
  • Protein Structure, Tertiary*
  • Reproducibility of Results
  • Sequence Analysis, Protein
  • Surface Properties

Substances

  • Membrane Proteins