De Novo Drug Design Using Transformer-Based Machine Translation and Reinforcement Learning of an Adaptive Monte Carlo Tree Search

Dony Ang; Cyril Rakovski; Hagop S Atamian

doi:10.3390/ph17020161

De Novo Drug Design Using Transformer-Based Machine Translation and Reinforcement Learning of an Adaptive Monte Carlo Tree Search

Pharmaceuticals (Basel). 2024 Jan 27;17(2):161. doi: 10.3390/ph17020161.

Authors

Dony Ang^{1

2}, Cyril Rakovski^{1

2}, Hagop S Atamian^{2

3}

Affiliations

¹ Computational and Data Sciences Program, Chapman University, Orange, CA 92866, USA.
² Schmid College of Science and Technology, Chapman University, Orange, CA 92866, USA.
³ Biological Sciences Program, Chapman University, Orange, CA 92866, USA.

Abstract

The discovery of novel therapeutic compounds through de novo drug design represents a critical challenge in the field of pharmaceutical research. Traditional drug discovery approaches are often resource intensive and time consuming, leading researchers to explore innovative methods that harness the power of deep learning and reinforcement learning techniques. Here, we introduce a novel drug design approach called drugAI that leverages the Encoder-Decoder Transformer architecture in tandem with Reinforcement Learning via a Monte Carlo Tree Search (RL-MCTS) to expedite the process of drug discovery while ensuring the production of valid small molecules with drug-like characteristics and strong binding affinities towards their targets. We successfully integrated the Encoder-Decoder Transformer architecture, which generates molecular structures (drugs) from scratch with the RL-MCTS, serving as a reinforcement learning framework. The RL-MCTS combines the exploitation and exploration capabilities of a Monte Carlo Tree Search with the machine translation of a transformer-based Encoder-Decoder model. This dynamic approach allows the model to iteratively refine its drug candidate generation process, ensuring that the generated molecules adhere to essential physicochemical and biological constraints and effectively bind to their targets. The results from drugAI showcase the effectiveness of the proposed approach across various benchmark datasets, demonstrating a significant improvement in both the validity and drug-likeness of the generated compounds, compared to two existing benchmark methods. Moreover, drugAI ensures that the generated molecules exhibit strong binding affinities to their respective targets. In summary, this research highlights the real-world applications of drugAI in drug discovery pipelines, potentially accelerating the identification of promising drug candidates for a wide range of diseases.

Keywords: artificial intelligence; drug design; encoder–decoder; molecular docking; novel molecules; quantitative estimate of drug-likeness (QED); reinforcement learning; transformer; validity; virtual screening.

Grants and funding

KF37/Research award from the Kay Family Foundation to H.S.A