pROC-Chemotype Plots Enhance the Interpretability of Benchmarking Results in Structure-Based Virtual Screening

J Chem Inf Model. 2015 Nov 23;55(11):2297-307. doi: 10.1021/acs.jcim.5b00475. Epub 2015 Oct 21.

Abstract

Recently, we have reported a systematic comparison of molecular preparation protocols (using MOE or Maestro) in combination with two docking tools (GOLD or Glide), employing our DEKOIS 2.0 benchmark sets. Herein, we demonstrate how comparable settings of data preparation protocols can affect the profile and AUC of pROC curves based on variations in chemotype enrichment. We show how the recognition of different classes of chemotypes can affect the docking performance, particularly in the early enrichment, and monitor changes in this recognition behavior based on score normalization and rescoring strategies. For this, we have developed "pROC-Chemotype", which is an automated protocol that matches and visualizes ligand chemotype information together with potency classes in the pROC profiles obtained by docking. This tool enhances the understanding of the influence of chemotype recognition in early enrichment, but also reveals trends of impaired recognition of chemotype classes at the end of the score-ordered rank. Identifying such issues helps to devise score-normalization strategies to overcome this potential bias in an intuitive manner. Furthermore, strong perturbations in chemotype ranking between different methods can help to identify the underlying reasons (e.g., changes in the protonation/tautomerization state). It also assists in the selection of appropriate scoring functions that are capable to retrieve more potent and diverse hits. In summary, we demonstrate how this new tool can be utilized to identify and highlight chemotype-specific behavior, e.g., in dataset preparation. This can help to overcome some chemistry-related bias in virtual screening campaigns. pROC-Chemotype is made freely available at www.dekois.com.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Area Under Curve
  • Drug Design*
  • Folic Acid Antagonists / chemistry
  • Folic Acid Antagonists / pharmacology*
  • Humans
  • Ligands
  • Molecular Docking Simulation*
  • Protein Binding
  • Quinazolines / chemistry
  • Quinazolines / pharmacology
  • ROC Curve
  • Tetrahydrofolate Dehydrogenase / metabolism*

Substances

  • Folic Acid Antagonists
  • Ligands
  • Quinazolines
  • Tetrahydrofolate Dehydrogenase