Biosystems Design by Machine Learning

ACS Synth Biol. 2020 Jul 17;9(7):1514-1533. doi: 10.1021/acssynbio.0c00129. Epub 2020 Jun 29.

Abstract

Biosystems such as enzymes, pathways, and whole cells have been increasingly explored for biotechnological applications. However, the intricate connectivity and resulting complexity of biosystems poses a major hurdle in designing biosystems with desirable features. As -omics and other high throughput technologies have been rapidly developed, the promise of applying machine learning (ML) techniques in biosystems design has started to become a reality. ML models enable the identification of patterns within complicated biological data across multiple scales of analysis and can augment biosystems design applications by predicting new candidates for optimized performance. ML is being used at every stage of biosystems design to help find nonobvious engineering solutions with fewer design iterations. In this review, we first describe commonly used models and modeling paradigms within ML. We then discuss some applications of these models that have already shown success in biotechnological applications. Moreover, we discuss successful applications at all scales of biosystems design, including nucleic acids, genetic circuits, proteins, pathways, genomes, and bioprocesses. Finally, we discuss some limitations of these methods and potential solutions as well as prospects of the combination of ML and biosystems design.

Keywords: biosystems design; machine learning; metabolic engineering; synthetic biology.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Review

MeSH terms

  • Biotechnology*
  • Gene Editing
  • Gene Regulatory Networks
  • Linear Models
  • Machine Learning*
  • Metabolic Engineering
  • Proteins* / chemistry
  • Proteins* / metabolism

Substances

  • Proteins