Computationally predicting binding affinity in protein-ligand complexes: free energy-based simulations and machine learning-based scoring functions

Brief Bioinform. 2021 May 20;22(3):bbaa107. doi: 10.1093/bib/bbaa107.

Abstract

Accurately predicting protein-ligand binding affinities can substantially facilitate the drug discovery process, but it remains as a difficult problem. To tackle the challenge, many computational methods have been proposed. Among these methods, free energy-based simulations and machine learning-based scoring functions can potentially provide accurate predictions. In this paper, we review these two classes of methods, following a number of thermodynamic cycles for the free energy-based simulations and a feature-representation taxonomy for the machine learning-based scoring functions. More recent deep learning-based predictions, where a hierarchy of feature representations are generally extracted, are also reviewed. Strengths and weaknesses of the two classes of methods, coupled with future directions for improvements, are comparatively discussed.

Keywords: affinity prediction; deep learning; free energy-based simulation; machine learning; protein–ligand binding affinity; scoring function.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Databases, Protein*
  • Ligands
  • Machine Learning*
  • Molecular Docking Simulation*
  • Protein Binding
  • Proteins / chemistry*
  • Proteins / metabolism
  • Thermodynamics

Substances

  • Ligands
  • Proteins