Ensemble learning application to discover new trypanothione synthetase inhibitors

Mol Divers. 2021 Aug;25(3):1361-1373. doi: 10.1007/s11030-021-10265-9. Epub 2021 Jul 15.

Abstract

Trypanosomatid-caused diseases are among the neglected infectious diseases with the highest disease burden, affecting about 27 million people worldwide and, in particular, socio-economically vulnerable populations. Trypanothione synthetase (TryS) is considered one of the most attractive drug targets within the thiol-polyamine metabolism of typanosomatids, being unique, essential and druggable. Here, we have compiled a dataset of 401 T. brucei TryS inhibitors that includes compounds with inhibitory data reported in the literature, but also in-house acquired data. QSAR classifiers were derived and validated from such dataset, using publicly available and open-source software, thus assuring the portability of the obtained models. The performance and robustness of the resulting models were substantially improved through ensemble learning. The performance of the individual models and the model ensembles was further assessed through retrospective virtual screening campaigns. At last, as an application example, the chosen model-ensemble has been applied in a prospective virtual screening campaign on DrugBank 5.1.6 compound library. All the in-house scripts used in this study are available on request, whereas the dataset has been included as supplementary material.

Keywords: Chagas disease; Ensemble learning; Machine learning; QSAR; Trypanosoma cruzi; Trypanothione synthetase.

MeSH terms

  • Algorithms
  • Amide Synthases / antagonists & inhibitors
  • Amide Synthases / chemistry*
  • Amide Synthases / metabolism
  • Antiprotozoal Agents / chemistry
  • Antiprotozoal Agents / pharmacology
  • Databases, Pharmaceutical
  • Drug Discovery / methods*
  • Drug Evaluation, Preclinical / methods
  • Drug Evaluation, Preclinical / standards
  • Enzyme Inhibitors / chemistry*
  • Enzyme Inhibitors / pharmacology
  • Humans
  • Machine Learning*
  • Metabolic Networks and Pathways
  • Models, Theoretical
  • ROC Curve
  • Structure-Activity Relationship

Substances

  • Antiprotozoal Agents
  • Enzyme Inhibitors
  • Amide Synthases
  • trypanothione synthetase