Computational methods and tools for binding site recognition between proteins and small molecules: from classical geometrical approaches to modern machine learning strategies

J Comput Aided Mol Des. 2019 Oct;33(10):887-903. doi: 10.1007/s10822-019-00235-7. Epub 2019 Oct 18.

Abstract

In the current "genomic era" the number of identified genes is growing exponentially. However, the biological function of a large number of the corresponding proteins is still unknown. Recognition of small molecule ligands (e.g., substrates, inhibitors, allosteric regulators, etc.) is pivotal for protein functions in the vast majority of the cases and knowledge of the region where these processes take place is essential for protein function prediction and drug design. In this regard, computational methods represent essential tools to tackle this problem. A significant number of software tools have been developed in the last few years which exploit either protein sequence information, structure information or both. This review describes the most recent developments in protein function recognition and binding site prediction, in terms of both freely-available and commercial solutions and tools, detailing the main characteristics of the considered tools and providing a comparative analysis of their performance.

Keywords: Binding region; Binding site recognition; Protein–ligand interactions; Software.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Artificial Intelligence
  • Computational Biology / methods*
  • Drug Design*
  • Humans
  • Ligands
  • Machine Learning*
  • Models, Molecular
  • Protein Binding
  • Protein Conformation
  • Proteins / chemistry*
  • Proteins / metabolism*
  • Sequence Homology
  • Small Molecule Libraries / chemistry*
  • Small Molecule Libraries / metabolism*
  • Software

Substances

  • Ligands
  • Proteins
  • Small Molecule Libraries