Exploring the chemical space of protein-protein interaction inhibitors through machine learning

Sci Rep. 2021 Jun 28;11(1):13369. doi: 10.1038/s41598-021-92825-5.

Abstract

Although protein-protein interactions (PPIs) have emerged as the basis of potential new therapeutic approaches, targeting intracellular PPIs with small molecule inhibitors is conventionally considered highly challenging. Driven by increasing research efforts, success rates have increased significantly in recent years. In this study, we analyze the physicochemical properties of 9351 non-redundant inhibitors present in the iPPI-DB and TIMBAL databases to define a computational model for active compounds acting against PPI targets. Principle component analysis (PCA) and k-means clustering were used to identify plausible PPI targets in regions of interest in the active group in the chemical space between active and inactive iPPI compounds. Notably, the uniquely defined active group exhibited distinct differences in activity compared with other active compounds. These results demonstrate that active compounds with regions of interest in the chemical space may be expected to provide insights into potential PPI inhibitors for particular protein targets.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computer Simulation
  • Drug Discovery / methods
  • Humans
  • Machine Learning
  • Principal Component Analysis / methods
  • Protein Interaction Mapping / methods
  • Proteins / chemistry*
  • Small Molecule Libraries / chemistry*

Substances

  • Proteins
  • Small Molecule Libraries