Macrocyclization of linear molecules by deep learning to facilitate macrocyclic drug candidates discovery

Nat Commun. 2023 Jul 28;14(1):4552. doi: 10.1038/s41467-023-40219-8.

Abstract

Interest in macrocycles as potential therapeutic agents has increased rapidly. Macrocyclization of bioactive acyclic molecules provides a potential avenue to yield novel chemical scaffolds, which can contribute to the improvement of the biological activity and physicochemical properties of these molecules. In this study, we propose a computational macrocyclization method based on Transformer architecture (which we name Macformer). Leveraging deep learning, Macformer explores the vast chemical space of macrocyclic analogues of a given acyclic molecule by adding diverse linkers compatible with the acyclic molecule. Macformer can efficiently learn the implicit relationships between acyclic and macrocyclic structures represented as SMILES strings and generate plenty of macrocycles with chemical diversity and structural novelty. In data augmentation scenarios using both internal ChEMBL and external ZINC test datasets, Macformer display excellent performance and generalisability. We showcase the utility of Macformer when combined with molecular docking simulations and wet lab based experimental validation, by applying it to the prospective design of macrocyclic JAK2 inhibitors.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Deep Learning*
  • Drug Discovery / methods
  • Janus Kinase Inhibitors*
  • Macrocyclic Compounds* / chemistry
  • Macrocyclic Compounds* / pharmacology
  • Molecular Docking Simulation

Substances

  • Macrocyclic Compounds
  • Janus Kinase Inhibitors