Compound-protein interaction prediction by deep learning: Databases, descriptors and models

Drug Discov Today. 2022 May;27(5):1350-1366. doi: 10.1016/j.drudis.2022.02.023. Epub 2022 Mar 3.

Abstract

The screening of compound-protein interactions (CPIs) is one of the most crucial steps in finding hit and lead compounds. Deep learning (DL) methods for CPI prediction can address intrinsic limitations of traditional HTS and virtual screening with the advantage of low cost and high efficiency. This review provides a comprehensive survey of DL-based CPI prediction. It first summarizes popular databases of small-molecule compounds, proteins and binding complexes. Then, it outlines classical representations of compounds and proteins in turn. After that, this review briefly introduces state-of-the-art DL-based models in terms of design paradigms and investigates their prediction performance. Finally, it indicates current challenges and trends toward better CPI prediction and sketches out crucial approaches toward practical applications.

Keywords: Compound–protein interaction; Deep learning; Drug discovery; Embedding; Representation.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Databases, Factual
  • Deep Learning*
  • Drug Discovery / methods
  • Proteins / metabolism

Substances

  • Proteins