Machine learning models for classification tasks related to drug safety

Mol Divers. 2021 Aug;25(3):1409-1424. doi: 10.1007/s11030-021-10239-x. Epub 2021 Jun 10.

Abstract

In this review, we outline the current trends in the field of machine learning-driven classification studies related to ADME (absorption, distribution, metabolism and excretion) and toxicity endpoints from the past six years (2015-2021). The study focuses only on classification models with large datasets (i.e. more than a thousand compounds). A comprehensive literature search and meta-analysis was carried out for nine different targets: hERG-mediated cardiotoxicity, blood-brain barrier penetration, permeability glycoprotein (P-gp) substrate/inhibitor, cytochrome P450 enzyme family, acute oral toxicity, mutagenicity, carcinogenicity, respiratory toxicity and irritation/corrosion. The comparison of the best classification models was targeted to reveal the differences between machine learning algorithms and modeling types, endpoint-specific performances, dataset sizes and the different validation protocols. Based on the evaluation of the data, we can say that tree-based algorithms are (still) dominating the field, with consensus modeling being an increasing trend in drug safety predictions. Although one can already find classification models with great performances to hERG-mediated cardiotoxicity and the isoenzymes of the cytochrome P450 enzyme family, these targets are still central to ADMET-related research efforts.

Keywords: ADMET; Big data; In silico modeling; Machine learning; QSAR; Toxicity.

Publication types

  • Review

MeSH terms

  • Algorithms
  • Drug Design*
  • Drug-Related Side Effects and Adverse Reactions
  • ERG1 Potassium Channel / chemistry
  • ERG1 Potassium Channel / genetics
  • Humans
  • Machine Learning*
  • Models, Molecular*
  • Neural Networks, Computer
  • Pharmacokinetics
  • Quantitative Structure-Activity Relationship*
  • Support Vector Machine
  • Tissue Distribution

Substances

  • ERG1 Potassium Channel
  • KCNH2 protein, human