Predicting combinative drug pairs via multiple classifier system with positive samples only

Comput Methods Programs Biomed. 2019 Jan:168:1-10. doi: 10.1016/j.cmpb.2018.11.002. Epub 2018 Nov 15.

Abstract

Background and objective: Due to the synergistic effects of drugs, drug combination is one of the effective approaches for treating complex diseases. However, the identification of drug combinations by dose-response methods is still costly. It is promising to develop supervised learning-based approaches to predict potential drug combinations on a large scale. Nevertheless, these approaches have the inadequate utilization of heterogeneous features, which causes the loss of information useful to classification. Moreover, they have an intrinsic bias, because they assume unknown drug pairs as non-combinations, of which some could be real drug combinations in practice.

Methods: To address above issues, this work first designs a two-layer multiple classifier system (TLMCS) to effectively integrate heterogeneous features involving anatomical therapeutic chemical codes of drugs, drug-drug interactions, drug-target interactions, gene ontology of drug targets, and side effects. To avoid the bias caused by labelling unknown samples as negative, it then utilizes the one-class support vector machines, (which requires no negative instance and only labels approved drug combinations as positive instances), as the member classifiers in TLMCS. Last, both a 10-fold cross validation (10-CV) and a novel prediction are performed to validate the performance of TLMCS.

Results: The comparison with three state-of-the-art approaches under 10-CV exhibits the superiority of TLMCS, which achieves the area under the receiver operating characteristic curve = 0.824 and the area under the precision-recall curve = 0.372. Moreover, the experiment under the novel prediction demonstrates its ability, where 9 out of the top-20 predicted combinative drug pairs are validated by checking the published literature. Furthermore, for each of the newly-validated drug combinations, this work analyses the combining mode of the member drugs and investigates their relationship in terms of drug targeting pathways.

Conclusions: The proposed TLMCS provides an effective framework to integrate those heterogeneous features and is trained by only positive samples such that the bias of taking unknown drug pairs as negative samples can be avoided. Furthermore, its results in the novel prediction reveal five types of drug combinations and three types of drug relationships in terms of pathways.

Keywords: Drug combination; Heterogeneous features; Multiple classifier system; One-class classification.

MeSH terms

  • Algorithms
  • Computational Biology
  • Computer Simulation
  • Databases, Factual
  • Drug Combinations*
  • Drug Evaluation, Preclinical / methods*
  • Drug Interactions*
  • Humans
  • Pharmaceutical Preparations / chemistry*
  • Pharmaceutical Preparations / classification*
  • Pharmacy / instrumentation*
  • Pharmacy / methods
  • ROC Curve
  • Software

Substances

  • Drug Combinations
  • Pharmaceutical Preparations