An inductive graph neural network model for compound-protein interaction prediction based on a homogeneous graph

Brief Bioinform. 2022 May 13;23(3):bbac073. doi: 10.1093/bib/bbac073.

Abstract

Identifying the potential compound-protein interactions (CPIs) plays an essential role in drug development. The computational approaches for CPI prediction can reduce time and costs of experimental methods and have benefited from the continuously improved graph representation learning. However, most of the network-based methods use heterogeneous graphs, which is challenging due to their complex structures and heterogeneous attributes. Therefore, in this work, we transformed the compound-protein heterogeneous graph to a homogeneous graph by integrating the ligand-based protein representations and overall similarity associations. We then proposed an Inductive Graph AggrEgator-based framework, named CPI-IGAE, for CPI prediction. CPI-IGAE learns the low-dimensional representations of compounds and proteins from the homogeneous graph in an end-to-end manner. The results show that CPI-IGAE performs better than some state-of-the-art methods. Further ablation study and visualization of embeddings reveal the advantages of the model architecture and its role in feature extraction, and some of the top ranked CPIs by CPI-IGAE have been validated by a review of recent literature. The data and source codes are available at https://github.com/wanxiaozhe/CPI-IGAE.

Keywords: compound–protein interaction prediction; end-to-end learning; homogeneous graph; inductive graph neural network.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Drug Development*
  • Neural Networks, Computer*
  • Protein Interaction Mapping
  • Protein Interaction Maps*
  • Proteins* / chemistry
  • Software

Substances

  • Proteins