MulinforCPI: enhancing precision of compound-protein interaction prediction through novel perspectives on multi-level information integration

Brief Bioinform. 2023 Nov 22;25(1):bbad484. doi: 10.1093/bib/bbad484.

Abstract

Forecasting the interaction between compounds and proteins is crucial for discovering new drugs. However, previous sequence-based studies have not utilized three-dimensional (3D) information on compounds and proteins, such as atom coordinates and distance matrices, to predict binding affinity. Furthermore, numerous widely adopted computational techniques have relied on sequences of amino acid characters for protein representations. This approach may constrain the model's ability to capture meaningful biochemical features, impeding a more comprehensive understanding of the underlying proteins. Here, we propose a two-step deep learning strategy named MulinforCPI that incorporates transfer learning techniques with multi-level resolution features to overcome these limitations. Our approach leverages 3D information from both proteins and compounds and acquires a profound understanding of the atomic-level features of proteins. Besides, our research highlights the divide between first-principle and data-driven methods, offering new research prospects for compound-protein interaction tasks. We applied the proposed method to six datasets: Davis, Metz, KIBA, CASF-2016, DUD-E and BindingDB, to evaluate the effectiveness of our approach.

Keywords: 3D geometric information; compound–protein interaction; cross-attention; multi-level information; transfer learning.

MeSH terms

  • Amino Acids*
  • Protein Binding
  • Protein Conformation
  • Protein Interaction Mapping*

Substances

  • Amino Acids