Molecular de-novo design through deep reinforcement learning

Marcus Olivecrona; Thomas Blaschke; Ola Engkvist; Hongming Chen

doi:10.1186/s13321-017-0235-x

Molecular de-novo design through deep reinforcement learning

J Cheminform. 2017 Sep 4;9(1):48. doi: 10.1186/s13321-017-0235-x.

Authors

Marcus Olivecrona¹, Thomas Blaschke², Ola Engkvist², Hongming Chen²

Affiliations

¹ Hit Discovery, Discovery Sciences, Innovative Medicines and Early Development Biotech Unit, AstraZeneca R&D Gothenburg, 43183, Mölndal, Sweden. m.olivecrona@gmail.com.
² Hit Discovery, Discovery Sciences, Innovative Medicines and Early Development Biotech Unit, AstraZeneca R&D Gothenburg, 43183, Mölndal, Sweden.

Abstract

This work introduces a method to tune a sequence-based generative model for molecular de novo design that through augmented episodic likelihood can learn to generate structures with certain specified desirable properties. We demonstrate how this model can execute a range of tasks such as generating analogues to a query structure and generating compounds predicted to be active against a biological target. As a proof of principle, the model is first trained to generate molecules that do not contain sulphur. As a second example, the model is trained to generate analogues to the drug Celecoxib, a technique that could be used for scaffold hopping or library expansion starting from a single molecule. Finally, when tuning the model towards generating compounds predicted to be active against the dopamine receptor type 2, the model generates structures of which more than 95% are predicted to be active, including experimentally confirmed actives that have not been included in either the generative model nor the activity prediction model. Graphical abstract .

Keywords: De novo design; Recurrent neural networks; Reinforcement learning.