Geometric and statistical properties of the mean-field hydrophobic-polar model, the large-small model, and real protein sequences

Phys Rev E Stat Nonlin Soft Matter Phys. 2002 Apr;65(4 Pt 1):041923. doi: 10.1103/PhysRevE.65.041923. Epub 2002 Apr 11.

Abstract

Lattice models, for their coarse-grained nature, are best suited for the study of the "designability problem," the phenomenon in which most of the about 16 000 proteins of known structure have their native conformations concentrated in a relatively small number of about 500 topological classes of conformations. Here it is shown that on a lattice the most highly designable simulated protein structures are those that have the largest number of surface-core switchbacks. A combination of physical, mathematical, and biological reasons that causes the phenomenon is given. By comparing the most foldable model peptides with protein sequences in the Protein Data Bank, it is shown that whereas different models may yield similar designabilities, predicted foldable peptides will simulate natural proteins only when the model incorporates the correct physics and biology, in this case if the main folding force arises from the differing hydrophobicity of the residues, but does not originate, say, from the steric hindrance effect caused by the differing sizes of the residues.

Publication types

  • Comparative Study
  • Multicenter Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence*
  • Computational Biology* / methods
  • Computational Biology* / statistics & numerical data
  • Databases, Protein
  • Hydrophobic and Hydrophilic Interactions*
  • Models, Biological*
  • Models, Chemical*
  • Models, Statistical*
  • Protein Conformation
  • Protein Folding
  • Proteins / chemistry*

Substances

  • Proteins