Toward Explainable Anticancer Compound Sensitivity Prediction via Multimodal Attention-Based Convolutional Encoders

Matteo Manica; Ali Oskooei; Jannis Born; Vigneshwari Subramanian; Julio Sáez-Rodríguez; María Rodríguez Martínez

doi:10.1021/acs.molpharmaceut.9b00520

Toward Explainable Anticancer Compound Sensitivity Prediction via Multimodal Attention-Based Convolutional Encoders

Mol Pharm. 2019 Dec 2;16(12):4797-4806. doi: 10.1021/acs.molpharmaceut.9b00520. Epub 2019 Oct 31.

Authors

Matteo Manica¹, Ali Oskooei¹, Jannis Born^{1

2

3}, Vigneshwari Subramanian⁴, Julio Sáez-Rodríguez⁵, María Rodríguez Martínez¹

Affiliations

¹ IBM Research, 8803 Zürich, Switzerland.
² ETH Zürich, 8092 Zürich, Switzerland.
³ University of Zürich, 8006 Zürich, Switzerland.
⁴ RWTH Aachen University, 52056 Aachen, Germany.
⁵ Heidelberg University, 69047 Heidelberg, Germany.

PMID: 31618586
DOI: 10.1021/acs.molpharmaceut.9b00520

Abstract

In line with recent advances in neural drug design and sensitivity prediction, we propose a novel architecture for interpretable prediction of anticancer compound sensitivity using a multimodal attention-based convolutional encoder. Our model is based on the three key pillars of drug sensitivity: compounds' structure in the form of a SMILES sequence, gene expression profiles of tumors, and prior knowledge on intracellular interactions from protein-protein interaction networks. We demonstrate that our multiscale convolutional attention-based encoder significantly outperforms a baseline model trained on Morgan fingerprints and a selection of encoders based on SMILES, as well as the previously reported state-of-the-art for multimodal drug sensitivity prediction (R² = 0.86 and RMSE = 0.89). Moreover, the explainability of our approach is demonstrated by a thorough analysis of the attention weights. We show that the attended genes significantly enrich apoptotic processes and that the drug attention is strongly correlated with a standard chemical structure similarity index. Finally, we report a case study of two receptor tyrosine kinase (RTK) inhibitors acting on a leukemia cell line, showcasing the ability of the model to focus on informative genes and submolecular regions of the two compounds. The demonstrated generalizability and the interpretability of our model testify to its potential for in silico prediction of anticancer compound efficacy on unseen cancer cells, positioning it as a valid solution for the development of personalized therapies as well as for the evaluation of candidate compounds in de novo drug design.

Keywords: CNN; EC50; GDSC; IC50; RNN; SMILES; anticancer compounds; attention; computational systems biology; deep learning; drug discovery; drug sensitivity; drug sensitivity prediction; explainability; gene expression; interpretability; lead discovery; machine learning; molecular fingerprints; molecular networks; multimodal; multiscale; personalized medicine; precision medicine.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms*
Antineoplastic Agents
Deep Learning
Drug Design
Humans
Neural Networks, Computer

Substances

Antineoplastic Agents