Rosetta design with co-evolutionary information retains protein function

PLoS Comput Biol. 2021 Jan 19;17(1):e1008568. doi: 10.1371/journal.pcbi.1008568. eCollection 2021 Jan.

Abstract

Computational protein design has the ambitious goal of crafting novel proteins that address challenges in biology and medicine. To overcome these challenges, the computational protein modeling suite Rosetta has been tailored to address various protein design tasks. Recently, statistical methods have been developed that identify correlated mutations between residues in a multiple sequence alignment of homologous proteins. These subtle inter-dependencies in the occupancy of residue positions throughout evolution are crucial for protein function, but we found that three current Rosetta design approaches fail to recover these co-evolutionary couplings. Thus, we developed the Rosetta method ResCue (residue-coupling enhanced) that leverages co-evolutionary information to favor sequences which recapitulate correlated mutations, as observed in nature. To assess the protocols via recapitulation designs, we compiled a benchmark of ten proteins each represented by two, structurally diverse states. We could demonstrate that ResCue designed sequences with an average sequence recovery rate of 70%, whereas three other protocols reached not more than 50%, on average. Our approach had higher recovery rates also for functionally important residues, which were studied in detail. This improvement has only a minor negative effect on the fitness of the designed sequences as assessed by Rosetta energy. In conclusion, our findings support the idea that informing protocols with co-evolutionary signals helps to design stable and native-like proteins that are compatible with the different conformational states required for a complex function.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Amino Acid Sequence
  • Amino Acids / chemistry
  • Amino Acids / metabolism
  • Amino Acids / physiology
  • Computational Biology / methods*
  • Conserved Sequence
  • Evolution, Molecular*
  • Models, Molecular
  • Protein Conformation*
  • Protein Domains / physiology
  • Proteins* / chemistry
  • Proteins* / metabolism
  • Proteins* / physiology
  • Sequence Alignment / methods*
  • Sinorhizobium meliloti
  • Thermodynamics

Substances

  • Amino Acids
  • Proteins