Exploring the repeat protein universe through computational protein design

Nature. 2015 Dec 24;528(7583):580-4. doi: 10.1038/nature16162. Epub 2015 Dec 16.

Abstract

A central question in protein evolution is the extent to which naturally occurring proteins sample the space of folded structures accessible to the polypeptide chain. Repeat proteins composed of multiple tandem copies of a modular structure unit are widespread in nature and have critical roles in molecular recognition, signalling, and other essential biological processes. Naturally occurring repeat proteins have been re-engineered for molecular recognition and modular scaffolding applications. Here we use computational protein design to investigate the space of folded structures that can be generated by tandem repeating a simple helix-loop-helix-loop structural motif. Eighty-three designs with sequences unrelated to known repeat proteins were experimentally characterized. Of these, 53 are monomeric and stable at 95 °C, and 43 have solution X-ray scattering spectra consistent with the design models. Crystal structures of 15 designs spanning a broad range of curvatures are in close agreement with the design models with root mean square deviations ranging from 0.7 to 2.5 Å. Our results show that existing repeat proteins occupy only a small fraction of the possible repeat protein sequence and structure space and that it is possible to design novel repeat proteins with precisely specified geometries, opening up a wide array of new possibilities for biomolecular engineering.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Motifs*
  • Amino Acid Sequence
  • Bioengineering*
  • Computer Simulation*
  • Crystallography, X-Ray
  • Models, Molecular
  • Protein Conformation*
  • Protein Folding
  • Protein Stability
  • Proteins / chemistry*
  • Temperature

Substances

  • Proteins

Associated data

  • PDB/5CWB
  • PDB/5CWC
  • PDB/5CWD
  • PDB/5CWF
  • PDB/5CWG
  • PDB/5CWH
  • PDB/5CWI
  • PDB/5CWJ
  • PDB/5CWK
  • PDB/5CWL
  • PDB/5CWM
  • PDB/5CWN
  • PDB/5CWO
  • PDB/5CWP
  • PDB/5CWQ