Protein design under competing conditions for the availability of amino acids

Sci Rep. 2020 Feb 14;10(1):2684. doi: 10.1038/s41598-020-59401-9.

Abstract

Isolating the properties of proteins that allow them to convert sequence into the structure is a long-lasting biophysical problem. In particular, studies focused extensively on the effect of a reduced alphabet size on the folding properties. However, the natural alphabet is a compromise between versatility and optimisation of the available resources. Here, for the first time, we include the impact of the relative availability of the amino acids to extract from the 20 letters the core necessary for protein stability. We present a computational protein design scheme that involves the competition for resources between a protein and a potential interaction partner that, additionally, gives us the chance to investigate the effect of the reduced alphabet on protein-protein interactions. We devise a scheme that automatically identifies the optimal reduced set of letters for the design of the protein, and we observe that even alphabets reduced down to 4 letters allow for single protein folding. However, it is only with 6 letters that we achieve optimal folding, thus recovering experimental observations. Additionally, we notice that the binding between the protein and a potential interaction partner could not be avoided with the investigated reduced alphabets. Therefore, we suggest that aggregation could have been a driving force in the evolution of the large protein alphabet.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Amines / chemistry
  • Amino Acid Sequence / genetics
  • Amino Acids
  • Computational Biology*
  • Protein Conformation*
  • Protein Folding*
  • Proteins / genetics
  • Proteins / ultrastructure*
  • Sequence Analysis, Protein

Substances

  • Amines
  • Amino Acids
  • Proteins