Multi-scaled self-attention for drug-target interaction prediction based on multi-granularity representation

BMC Bioinformatics. 2022 Aug 3;23(1):314. doi: 10.1186/s12859-022-04857-x.

Abstract

Background: Drug-target interaction (DTI) prediction plays a crucial role in drug discovery. Although the advanced deep learning has shown promising results in predicting DTIs, it still needs improvements in two aspects: (1) encoding method, in which the existing encoding method, character encoding, overlooks chemical textual information of atoms with multiple characters and chemical functional groups; as well as (2) the architecture of deep model, which should focus on multiple chemical patterns in drug and target representations.

Results: In this paper, we propose a multi-granularity multi-scaled self-attention (SAN) model by alleviating the above problems. Specifically, in process of encoding, we investigate a segmentation method for drug and protein sequences and then label the segmented groups as the multi-granularity representations. Moreover, in order to enhance the various local patterns in these multi-granularity representations, a multi-scaled SAN is built and exploited to generate deep representations of drugs and targets. Finally, our proposed model predicts DTIs based on the fusion of these deep representations. Our proposed model is evaluated on two benchmark datasets, KIBA and Davis. The experimental results reveal that our proposed model yields better prediction accuracy than strong baseline models.

Conclusion: Our proposed multi-granularity encoding method and multi-scaled SAN model improve DTI prediction by encoding the chemical textual information of drugs and targets and extracting their various local patterns, respectively.

Keywords: Deep learning; Drug–target interaction; Representations learning; Self-attention networks.

MeSH terms

  • Amino Acid Sequence
  • Attention
  • Drug Development*
  • Drug Discovery / methods
  • Proteins* / metabolism

Substances

  • Proteins