TransDTI: Transformer-Based Language Models for Estimating DTIs and Building a Drug Recommendation Workflow

Yogesh Kalakoti; Shashank Yadav; Durai Sundar

doi:10.1021/acsomega.1c05203

TransDTI: Transformer-Based Language Models for Estimating DTIs and Building a Drug Recommendation Workflow

ACS Omega. 2022 Jan 12;7(3):2706-2717. doi: 10.1021/acsomega.1c05203. eCollection 2022 Jan 25.

Authors

Yogesh Kalakoti¹, Shashank Yadav¹, Durai Sundar^{1

2}

Affiliations

¹ DAILAB, Department of Biochemical Engineering & Biotechnology, Indian Institute of Technology (IIT) Delhi, New Delhi 110016, India.
² School of Artificial Intelligence, Indian Institute of Technology (IIT) Delhi, New Delhi 110016, India.

Abstract

The identification of novel drug-target interactions is a labor-intensive and low-throughput process. In silico alternatives have proved to be of immense importance in assisting the drug discovery process. Here, we present TransDTI, a multiclass classification and regression workflow employing transformer-based language models to segregate interactions between drug-target pairs as active, inactive, and intermediate. The models were trained with large-scale drug-target interaction (DTI) data sets, which reported an improvement in performance in terms of the area under receiver operating characteristic (auROC), the area under precision recall (auPR), Matthew's correlation coefficient (MCC), and R2 over baseline methods. The results showed that models based on transformer-based language models effectively predict novel drug-target interactions from sequence data. The proposed models significantly outperformed existing methods like DeepConvDTI, DeepDTA, and DeepDTI on a test data set. Further, the validity of novel interactions predicted by TransDTI was found to be backed by molecular docking and simulation analysis, where the model prediction had similar or better interaction potential for MAP2k and transforming growth factor-β (TGFβ) and their known inhibitors. Proposed approaches can have a significant impact on the development of personalized therapy and clinical decision making.