De Novo Generation and Identification of Novel Compounds with Drug Efficacy Based on Machine Learning

Adv Sci (Weinh). 2024 Mar;11(11):e2307245. doi: 10.1002/advs.202307245. Epub 2024 Jan 10.

Abstract

One of the main challenges in small molecule drug discovery is finding novel chemical compounds with desirable activity. Traditional drug development typically begins with target selection, but the correlation between targets and disease remains to be further investigated, and drugs designed based on targets may not always have the desired drug efficacy. The emergence of machine learning provides a powerful tool to overcome the challenge. Herein, a machine learning-based strategy is developed for de novo generation of novel compounds with drug efficacy termed DTLS (Deep Transfer Learning-based Strategy) by using dataset of disease-direct-related activity as input. DTLS is applied in two kinds of disease: colorectal cancer (CRC) and Alzheimer's disease (AD). In each case, novel compound is discovered and identified in in vitro and in vivo disease models. Their mechanism of actionis further explored. The experimental results reveal that DTLS can not only realize the generation and identification of novel compounds with drug efficacy but also has the advantage of identifying compounds by focusing on protein targets to facilitate the mechanism study. This work highlights the significant impact of machine learning on the design of novel compounds with drug efficacy, which provides a powerful new approach to drug discovery.

Keywords: de novo design; drug efficacy; lead compound; machine learning.

MeSH terms

  • Drug Discovery* / methods
  • Machine Learning*
  • Proteins

Substances

  • Proteins