Sequence repeats and protein structure

Phys Rev E Stat Nonlin Soft Matter Phys. 2012 Nov;86(5 Pt 1):050901. doi: 10.1103/PhysRevE.86.050901. Epub 2012 Nov 8.

Abstract

Repeats are frequently found in known protein sequences. The level of sequence conservation in tandem repeats correlates with their propensities to be intrinsically disordered. We employ a coarse-grained model of a protein with a two-letter amino acid alphabet, hydrophobic (H) and polar (P), to examine the sequence-structure relationship in the realm of repeated sequences. A fraction of repeated sequences comprises a distinct class of bad folders, whose folding temperatures are much lower than those of random sequences. Imperfection in sequence repetition improves the folding properties of the bad folders while deteriorating those of the good folders. Our results may explain why nature has utilized repeated sequences for their versatility and especially to design functional proteins that are intrinsically unstructured at physiological temperatures.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acids / chemistry*
  • Computer Simulation
  • Models, Chemical*
  • Molecular Sequence Data
  • Protein Folding*
  • Proteins / chemistry*
  • Repetitive Sequences, Amino Acid*
  • Sequence Analysis, Protein / methods*
  • Structure-Activity Relationship
  • Temperature

Substances

  • Amino Acids
  • Proteins