Computational planning of the synthesis of complex natural products

Nature. 2020 Dec;588(7836):83-88. doi: 10.1038/s41586-020-2855-y. Epub 2020 Oct 13.

Abstract

Training algorithms to computationally plan multistep organic syntheses has been a challenge for more than 50 years1-7. However, the field has progressed greatly since the development of early programs such as LHASA1,7, for which reaction choices at each step were made by human operators. Multiple software platforms6,8-14 are now capable of completely autonomous planning. But these programs 'think' only one step at a time and have so far been limited to relatively simple targets, the syntheses of which could arguably be designed by human chemists within minutes, without the help of a computer. Furthermore, no algorithm has yet been able to design plausible routes to complex natural products, for which much more far-sighted, multistep planning is necessary15,16 and closely related literature precedents cannot be relied on. Here we demonstrate that such computational synthesis planning is possible, provided that the program's knowledge of organic chemistry and data-based artificial intelligence routines are augmented with causal relationships17,18, allowing it to 'strategize' over multiple synthetic steps. Using a Turing-like test administered to synthesis experts, we show that the routes designed by such a program are largely indistinguishable from those designed by humans. We also successfully validated three computer-designed syntheses of natural products in the laboratory. Taken together, these results indicate that expert-level automated synthetic planning is feasible, pending continued improvements to the reaction knowledge base and further code optimization.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Artificial Intelligence* / standards
  • Automation / methods
  • Automation / standards
  • Benzylisoquinolines / chemical synthesis
  • Benzylisoquinolines / chemistry
  • Biological Products / chemical synthesis*
  • Chemistry Techniques, Synthetic / methods*
  • Chemistry Techniques, Synthetic / standards
  • Chemistry, Organic / methods*
  • Chemistry, Organic / standards
  • Indans / chemical synthesis
  • Indans / chemistry
  • Indole Alkaloids / chemical synthesis
  • Indole Alkaloids / chemistry
  • Knowledge Bases
  • Lactones / chemical synthesis
  • Lactones / chemistry
  • Macrolides / chemical synthesis
  • Macrolides / chemistry
  • Reproducibility of Results
  • Sesquiterpenes / chemical synthesis
  • Sesquiterpenes / chemistry
  • Software* / standards
  • Tetrahydroisoquinolines / chemical synthesis
  • Tetrahydroisoquinolines / chemistry

Substances

  • Benzylisoquinolines
  • Biological Products
  • Indans
  • Indole Alkaloids
  • Lactones
  • Macrolides
  • Sesquiterpenes
  • Tetrahydroisoquinolines
  • aplykurodinone-1
  • callyspongiolide
  • lamellodysidine A
  • tacamonine
  • dauricine