Ligand Binding Prediction Using Protein Structure Graphs and Residual Graph Attention Networks

Molecules. 2022 Aug 11;27(16):5114. doi: 10.3390/molecules27165114.

Abstract

Computational prediction of ligand-target interactions is a crucial part of modern drug discovery as it helps to bypass high costs and labor demands of in vitro and in vivo screening. As the wealth of bioactivity data accumulates, it provides opportunities for the development of deep learning (DL) models with increasing predictive powers. Conventionally, such models were either limited to the use of very simplified representations of proteins or ineffective voxelization of their 3D structures. Herein, we present the development of the PSG-BAR (Protein Structure Graph-Binding Affinity Regression) approach that utilizes 3D structural information of the proteins along with 2D graph representations of ligands. The method also introduces attention scores to selectively weight protein regions that are most important for ligand binding. Results: The developed approach demonstrates the state-of-the-art performance on several binding affinity benchmarking datasets. The attention-based pooling of protein graphs enables identification of surface residues as critical residues for protein-ligand binding. Finally, we validate our model predictions against an experimental assay on a viral main protease (Mpro)-the hallmark target of SARS-CoV-2 coronavirus.

Keywords: SARS-CoV-2; computer-aided drug discovery; deep learning; drug–target interaction; graph attention network; protein–ligand binding; virtual screening.

MeSH terms

  • COVID-19*
  • Humans
  • Ligands
  • Protein Binding
  • Proteins / chemistry
  • SARS-CoV-2*

Substances

  • Ligands
  • Proteins