DTITR: End-to-end drug-target binding affinity prediction with transformers

Nelson R C Monteiro; José L Oliveira; Joel P Arrais

doi:10.1016/j.compbiomed.2022.105772

DTITR: End-to-end drug-target binding affinity prediction with transformers

Comput Biol Med. 2022 Aug:147:105772. doi: 10.1016/j.compbiomed.2022.105772. Epub 2022 Jun 21.

Authors

Nelson R C Monteiro¹, José L Oliveira², Joel P Arrais³

Affiliations

¹ Univ Coimbra, Centre for Informatics and Systems of the University of Coimbra, Department of Informatics Engineering, Coimbra, Portugal. Electronic address: nelsonrcm@dei.uc.pt.
² IEETA, Department of Electronics, Telecommunications and Informatics, University of Aveiro, Aveiro, Portugal. Electronic address: jlo@ua.pt.
³ Univ Coimbra, Centre for Informatics and Systems of the University of Coimbra, Department of Informatics Engineering, Coimbra, Portugal. Electronic address: jpa@dei.uc.pt.

PMID: 35777085
DOI: 10.1016/j.compbiomed.2022.105772

Abstract

The accurate identification of Drug-Target Interactions (DTIs) remains a critical turning point in drug discovery and understanding of the binding process. Despite recent advances in computational solutions to overcome the challenges of in vitro and in vivo experiments, most of the proposed in silico-based methods still focus on binary classification, overlooking the importance of characterizing DTIs with unbiased binding strength values to properly distinguish primary interactions from those with off-targets. Moreover, several of these methods usually simplify the entire interaction mechanism, neglecting the joint contribution of the individual units of each binding component and the interacting substructures involved, and have yet to focus on more explainable and interpretable architectures. In this study, we propose an end-to-end Transformer-based architecture for predicting drug-target binding affinity (DTA) using 1D raw sequential and structural data to represent the proteins and compounds. This architecture exploits self-attention layers to capture the biological and chemical context of the proteins and compounds, respectively, and cross-attention layers to exchange information and capture the pharmacological context of the DTIs. The results show that the proposed architecture is effective in predicting DTA, achieving superior performance in both correctly predicting the value of interaction strength and being able to correctly discriminate the rank order of binding strength compared to state-of-the-art baselines. The combination of multiple Transformer-Encoders was found to result in robust and discriminative aggregate representations of the proteins and compounds for binding affinity prediction, in which the addition of a Cross-Attention Transformer-Encoder was identified as an important block for improving the discriminative power of these representations. Overall, this research study validates the applicability of an end-to-end Transformer-based architecture in the context of drug discovery, capable of self-providing different levels of potential DTI and prediction understanding due to the nature of the attention blocks. The data and source code used in this study are available at: https://github.com/larngroup/DTITR.

Keywords: Attention; Binding affinity; Deep learning; Drug–target interaction; Transformer.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Drug Development
Drug Discovery / methods
Proteins* / chemistry
Software*

Substances

Proteins