Composition-based effective chain length for prediction of protein folding rates

Phys Rev E Stat Nonlin Soft Matter Phys. 2010 Nov;82(5 Pt 1):051930. doi: 10.1103/PhysRevE.82.051930. Epub 2010 Nov 24.

Abstract

Folding rate prediction is a useful way to find the key factors affecting folding kinetics of proteins. Structural information is more or less required in the present prediction methods, which limits the application of these methods to various proteins. In this work, an "effective length" is defined solely based on the composition of a protein, namely, the number of specific types of amino acids in a protein. A physical theory based on a minimalist model is employed to describe the relation between the folding rates and the effective length of proteins. Based on the resultant relationship between folding rates and effective length, the optimal sets of amino acids are found through the enumeration over all possible combinations of amino acids. This optimal set achieves a high correlation (with the coefficient of 0.84) between the folding rates and the optimal effective length. The features of these amino acids are consistent with our model and landscape theory. Further comparisons between our effective length and other factors are carried out. The effective length is physically consistent with structure-based prediction methods and has the best predictability for folding rates. These results all suggest that both entropy and energetics contribute importantly to folding kinetics. The ability to accurately and efficiently predict folding rates from composition enables the analysis of the kinetics for various kinds of proteins. The underlying physics in our method may be helpful to stimulate further understanding on the effects of various amino acids in folding dynamics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Kinetics
  • Models, Biological*
  • Protein Folding*
  • Proteins / chemistry*
  • Reproducibility of Results

Substances

  • Proteins