Fast Rescoring Protocols to Improve the Performance of Structure-Based Virtual Screening Performed on Protein-Protein Interfaces

J Chem Inf Model. 2020 Aug 24;60(8):3910-3934. doi: 10.1021/acs.jcim.0c00545. Epub 2020 Aug 11.

Abstract

Protein-protein interactions (PPIs) are attractive targets for drug design because of their essential role in numerous cellular processes and disease pathways. However, in general, PPIs display exposed binding pockets at the interface, and as such, have been largely unexploited for therapeutic interventions with low-molecular weight compounds. Here, we used docking and various rescoring strategies in an attempt to recover PPI inhibitors from a set of active and inactive molecules for 11 targets collected in ChEMBL and PubChem. Our focus is on the screening power of the various developed protocols and on using fast approaches so as to be able to apply such a strategy to the screening of ultralarge libraries in the future. First, we docked compounds into each target using the fast "pscreen" mode of the structure-based virtual screening (VS) package Surflex. Subsequently, the docking poses were postprocessed to derive a set of 3D topological descriptors: (i) shape similarity and (ii) interaction fingerprint similarity with a co-crystallized inhibitor, (iii) solvent-accessible surface area, and (iv) extent of deviation from the geometric center of a reference inhibitor. The derivatized descriptors, together with descriptor-scaled scoring functions, were utilized to investigate possible impacts on VS performance metrics. Moreover, four standalone scoring functions, RF-Score-VS (machine-learning), DLIGAND2 (knowledge-based), Vinardo (empirical), and X-SCORE (empirical), were employed to rescore the PPI compounds. Collectively, the results indicate that the topological scoring algorithms could be valuable both at a global level, with up to 79% increase in areas under the receiver operating characteristic curve for some targets, and in early stages, with up to a 4-fold increase in enrichment factors at 1% of the screened collections. Outstandingly, DLIGAND2 emerged as the best scoring function on this data set, outperforming all rescoring techniques in terms of VS metrics. The described methodology could help in the rational design of small-molecule PPI inhibitors and has direct applications in many therapeutic areas, including cancer, CNS, and infectious diseases such as COVID-19.

MeSH terms

  • Algorithms
  • Betacoronavirus / drug effects
  • Betacoronavirus / metabolism
  • COVID-19
  • Coronavirus Infections / drug therapy
  • Coronavirus Infections / metabolism
  • Databases, Protein
  • Drug Design*
  • Drug Discovery*
  • Humans
  • Ligands
  • Machine Learning
  • Molecular Docking Simulation
  • Molecular Targeted Therapy
  • Pandemics
  • Pneumonia, Viral / drug therapy
  • Pneumonia, Viral / metabolism
  • Protein Interaction Maps / drug effects*
  • Proteins / chemistry
  • Proteins / metabolism
  • SARS-CoV-2
  • Small Molecule Libraries / chemistry
  • Small Molecule Libraries / pharmacology*

Substances

  • Ligands
  • Proteins
  • Small Molecule Libraries