Machine Learning Enables Selection of Epistatic Enzyme Mutants for Stability Against Unfolding and Detrimental Aggregation

Chembiochem. 2021 Mar 2;22(5):904-914. doi: 10.1002/cbic.202000612. Epub 2020 Nov 17.

Abstract

Machine learning (ML) has pervaded most areas of protein engineering, including stability and stereoselectivity. Using limonene epoxide hydrolase as the model enzyme and innov'SAR as the ML platform, comprising a digital signal process, we achieved high protein robustness that can resist unfolding with concomitant detrimental aggregation. Fourier transform (FT) allows us to take into account the order of the protein sequence and the nonlinear interactions between positions, and thus to grasp epistatic phenomena. The innov'SAR approach is interpolative, extrapolative and makes outside-the-box, predictions not found in other state-of-the-art ML or deep learning approaches. Equally significant is the finding that our approach to ML in the present context, flanked by advanced molecular dynamics simulations, uncovers the connection between epistatic mutational interactions and protein robustness.

Keywords: artificial intelligence; epistasis; epoxide hydrolase; innov'SAR; machine learning; molecular dynamics simulations.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Epoxide Hydrolases / chemistry*
  • Epoxide Hydrolases / genetics
  • Epoxide Hydrolases / metabolism*
  • Limonene / chemistry
  • Limonene / metabolism
  • Machine Learning*
  • Molecular Dynamics Simulation
  • Mutation*
  • Protein Engineering
  • Protein Folding*
  • Protein Multimerization*
  • Rhodococcus / enzymology*

Substances

  • Limonene
  • Epoxide Hydrolases