A comprehensive comparative assessment of 3D molecular similarity tools in ligand-based virtual screening

Brief Bioinform. 2021 Nov 5;22(6):bbab231. doi: 10.1093/bib/bbab231.

Abstract

Three-dimensional (3D) molecular similarity, one major ligand-based virtual screening (VS) method, has been widely used in the drug discovery process. A variety of 3D molecular similarity tools have been developed in recent decades. In this study, we assessed a panel of 15 3D molecular similarity programs against the DUD-E and LIT-PCBA datasets, including commercial ROCS and Phase, in terms of screening power and scaffold-hopping power. The results revealed that (1) SHAFTS, LS-align, Phase Shape_Pharm and LIGSIFT showed the best VS capability in terms of screening power. Some 3D similarity tools available to academia can yield relatively better VS performance than commercial ROCS and Phase software. (2) Current 3D similarity VS tools exhibit a considerable ability to capture actives with new chemotypes in terms of scaffold hopping. (3) Multiple conformers relative to single conformations will generally improve VS performance for most 3D similarity tools, with marginal improvement observed in area under the receiving operator characteristic curve values, enrichment factor in the top 1% and hit rate in the top 1% values showed larger improvement. Moreover, redundancy and complementarity analyses of hit lists from different query seeds and different 3D similarity VS tools showed that the combination of different query seeds and/or different 3D similarity tools in VS campaigns retrieved more (and more diverse) active molecules. These findings provide useful information for guiding choices of the optimal 3D molecular similarity tools for VS practices and designing possible combination strategies to discover more diverse active compounds.

Keywords: 3D molecular similarity; query template; scaffold hopping; screening power; virtual screening.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Area Under Curve
  • Benchmarking
  • Databases, Pharmaceutical
  • Drug Design
  • Drug Discovery / methods*
  • Drug Evaluation, Preclinical / methods
  • Ligands
  • Models, Molecular*
  • Molecular Conformation*
  • Molecular Structure
  • ROC Curve
  • Software*
  • Web Browser

Substances

  • Ligands