Cross-Classified Multilevel Modelling of the Effectiveness of Similarity-Based Virtual Screening

ChemMedChem. 2018 Mar 20;13(6):582-587. doi: 10.1002/cmdc.201700487. Epub 2017 Nov 6.

Abstract

The screening effectiveness of a chemical similarity search depends on a range of factors, including the bioactivity of interest, the types of similarity coefficient and fingerprint that comprise the similarity measure, and the nature of the reference structure that is being searched against a database. This study introduces the use of cross-classified multilevel modelling as a way to investigate the relative importance of these four factors when carrying out similarity searches on the ChEMBL database. Two principal conclusions can be drawn from the analyses: that the fingerprint plays a more important role than the similarity coefficient in determining the effectiveness of a similarity search, and that comparative studies of similarity measures should involve many more reference structures than has been the case in much previous work.

Keywords: chemical fingerprints; cross-classified multilevel modelling; similarity coefficients; similarity searching; virtual screening.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Databases, Chemical*
  • Drug Discovery
  • Drug Evaluation, Preclinical*
  • Ligands
  • Models, Molecular
  • Pharmaceutical Preparations / chemistry*

Substances

  • Ligands
  • Pharmaceutical Preparations