Prediction of Anti-Glioblastoma Drug-Decorated Nanoparticle Delivery Systems Using Molecular Descriptors and Machine Learning

Int J Mol Sci. 2021 Oct 26;22(21):11519. doi: 10.3390/ijms222111519.

Abstract

The theoretical prediction of drug-decorated nanoparticles (DDNPs) has become a very important task in medical applications. For the current paper, Perturbation Theory Machine Learning (PTML) models were built to predict the probability of different pairs of drugs and nanoparticles creating DDNP complexes with anti-glioblastoma activity. PTML models use the perturbations of molecular descriptors of drugs and nanoparticles as inputs in experimental conditions. The raw dataset was obtained by mixing the nanoparticle experimental data with drug assays from the ChEMBL database. Ten types of machine learning methods have been tested. Only 41 features have been selected for 855,129 drug-nanoparticle complexes. The best model was obtained with the Bagging classifier, an ensemble meta-estimator based on 20 decision trees, with an area under the receiver operating characteristic curve (AUROC) of 0.96, and an accuracy of 87% (test subset). This model could be useful for the virtual screening of nanoparticle-drug complexes in glioblastoma. All the calculations can be reproduced with the datasets and python scripts, which are freely available as a GitHub repository from authors.

Keywords: ChEMBL database; anti-glioblastoma; big data; decorated nanoparticles; drug delivery; machine learning; perturbation theory.

MeSH terms

  • Antineoplastic Agents / administration & dosage*
  • Brain Neoplasms / drug therapy*
  • Databases, Chemical
  • Databases, Pharmaceutical
  • Drug Carriers / administration & dosage
  • Drug Delivery Systems*
  • Drug Design
  • Drug Screening Assays, Antitumor
  • Glioblastoma / drug therapy*
  • Humans
  • Machine Learning*
  • Nanoparticles* / administration & dosage
  • User-Computer Interface

Substances

  • Antineoplastic Agents
  • Drug Carriers