A deep learning approach to evaluate the feasibility of enzymatic reactions generated by retrobiosynthesis

Biotechnol J. 2021 May;16(5):e2000605. doi: 10.1002/biot.202000605. Epub 2021 Jan 18.

Abstract

Retrobiosynthesis allows the designing of novel biosynthetic pathways for the production of chemicals and materials through metabolic engineering, but generates a large number of reactions beyond the experimental feasibility. Thus, an effective method that can reduce a large number of the initially predicted enzymatic reactions has been needed. Here, we present Deep learning-based Reaction Feasibility Checker (DeepRFC) to classify the feasibility of a given enzymatic reaction with high performance and speed. DeepRFC is designed to receive Simplified Molecular-Input Line-Entry System (SMILES) strings of a reactant pair, which is defined as a substrate and a product of a reaction, as an input, and evaluates whether the input reaction is feasible. A deep neural network is selected for DeepRFC as it leads to better classification performance than five other representative machine learning methods examined. For validation, the performance of DeepRFC is compared with another in-house reaction feasibility checker that uses the concept of reaction similarity. Finally, the use of DeepRFC is demonstrated for the retrobiosynthesis-based design of novel one-carbon assimilation pathways. DeepRFC will allow retrobiosynthesis to be more practical for metabolic engineering applications by efficiently screening a large number of retrobiosynthesis-derived enzymatic reactions. DeepRFC is freely available at https://bitbucket.org/kaistsystemsbiology/deeprfc.

Keywords: DeepRFC; deep learning; enzymatic reaction; reaction feasibility; retrobiosynthesis.

MeSH terms

  • Biosynthetic Pathways
  • Deep Learning*
  • Feasibility Studies
  • Metabolic Engineering
  • Neural Networks, Computer