Prediction of synergistic drug combinations using PCA-initialized deep learning

Jun Ma; Alison Motsinger-Reif

doi:10.1186/s13040-021-00278-3

Prediction of synergistic drug combinations using PCA-initialized deep learning

BioData Min. 2021 Oct 20;14(1):46. doi: 10.1186/s13040-021-00278-3.

Authors

Jun Ma^{1

2}, Alison Motsinger-Reif³

Affiliations

¹ Bioinformatics Research Center, North Carolina State University, Raleigh, NC, USA.
² Biostatistics and Computational Biology Branch, National Institute of Environmental Health Sciences, 111 TW Alexander Drive, Durham, NC, 27709, USA.
³ Biostatistics and Computational Biology Branch, National Institute of Environmental Health Sciences, 111 TW Alexander Drive, Durham, NC, 27709, USA. alison.motsinger-reif@nih.gov.

Abstract

Background: Cancer is one of the main causes of death worldwide. Combination drug therapy has been a mainstay of cancer treatment for decades and has been shown to reduce host toxicity and prevent the development of acquired drug resistance. However, the immense number of possible drug combinations and large synergistic space makes it infeasible to screen all effective drug pairs experimentally. Therefore, it is crucial to develop computational approaches to predict drug synergy and guide experimental design for the discovery of rational combinations for therapy.

Results: We present a new deep learning approach to predict synergistic drug combinations by integrating gene expression profiles from cell lines and chemical structure data. Specifically, we use principal component analysis (PCA) to reduce the dimensionality of the chemical descriptor data and gene expression data. We then propagate the low-dimensional data through a neural network to predict drug synergy values. We apply our method to O'Neil's high-throughput drug combination screening data as well as a dataset from the AstraZeneca-Sanger Drug Combination Prediction DREAM Challenge. We compare the neural network approach with and without dimension reduction. Additionally, we demonstrate the effectiveness of our deep learning approach and compare its performance with three state-of-the-art machine learning methods: Random Forests, XGBoost, and elastic net, with and without PCA-based dimensionality reduction.

Conclusions: Our developed approach outperforms other machine learning methods, and the use of dimension reduction dramatically decreases the computation time without sacrificing accuracy.

Keywords: Cancer treatment; Deep learning; Drug combination treatment; Elastic net; Feedforward neural network; Machine learning; Random Forests; XGBoost.

Abstract

Grants and funding