AK-Score: Accurate Protein-Ligand Binding Affinity Prediction Using an Ensemble of 3D-Convolutional Neural Networks

Int J Mol Sci. 2020 Nov 10;21(22):8424. doi: 10.3390/ijms21228424.

Abstract

Accurate prediction of the binding affinity of a protein-ligand complex is essential for efficient and successful rational drug design. Therefore, many binding affinity prediction methods have been developed. In recent years, since deep learning technology has become powerful, it is also implemented to predict affinity. In this work, a new neural network model that predicts the binding affinity of a protein-ligand complex structure is developed. Our model predicts the binding affinity of a complex using the ensemble of multiple independently trained networks that consist of multiple channels of 3-D convolutional neural network layers. Our model was trained using the 3772 protein-ligand complexes from the refined set of the PDBbind-2016 database and tested using the core set of 285 complexes. The benchmark results show that the Pearson correlation coefficient between the predicted binding affinities by our model and the experimental data is 0.827, which is higher than the state-of-the-art binding affinity prediction scoring functions. Additionally, our method ranks the relative binding affinities of possible multiple binders of a protein quite accurately, comparable to the other scoring functions. Last, we measured which structural information is critical for predicting binding affinity and found that the complementarity between the protein and ligand is most important.

Keywords: ResNext; binding affinity prediction; convolutional neural network; deep learning; docking score; protein-ligand binding affinity.

MeSH terms

  • Computer-Aided Design
  • Databases, Protein
  • Deep Learning
  • Drug Design
  • Drug Discovery
  • Humans
  • Ligands
  • Molecular Docking Simulation
  • Molecular Dynamics Simulation
  • Neural Networks, Computer*
  • Protein Binding*
  • Proteins / chemistry*
  • Proteins / metabolism*
  • User-Computer Interface

Substances

  • Ligands
  • Proteins