Differentiation between two-state and multi-state folding proteins based on sequence

Proteins. 2008 Jul;72(1):44-9. doi: 10.1002/prot.21893.

Abstract

Prediction of protein-folding rates follows different rules in two-state and multi-state kinetics. The prerequisite for the prediction is to recognize the folding kinetic pathway of proteins. Here, we use the logistic regression and support vector machine to discriminate between two-state and multi-state folding proteins. We find that chain length is sufficient to accurately recognize multi-state proteins. There is a transition boundary between two kinetic models. Protein folds with multi-state kinetics, if its length is larger than 112 residues. The logistic prediction from amino acid composition shows that the kinetic pathway of folding is closely related to amino acid volume. Small amino acids make two-state folding easier, and vice versa. However, cysteine, alanine, arginine, lysine, histidine, and methionine do not conform to this rule.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Protein Conformation
  • Protein Folding*
  • Proteins / chemistry*
  • Proteins / metabolism*
  • Sequence Analysis, Protein*

Substances

  • Proteins