Scaffold-Based Analytics: Enabling Hit-to-Lead Decisions by Visualizing Chemical Series Linked across Large Datasets

J Chem Inf Model. 2019 Nov 25;59(11):4880-4892. doi: 10.1021/acs.jcim.9b00243. Epub 2019 Oct 29.

Abstract

We present a method for visualizing and navigating large screening datasets while also taking into account their activities and properties. Our approach is to annotate the data with all possible scaffolds contained within each molecule. We have developed a Spotfire visualization, coupled to a fuzzy clustering approach based on the scaffold decomposition of the screening deck, used to drive the hit triage process. Progression decisions can be made using aggregate scaffold parameters and data from multiple datasets merged at the scaffold level. This visualization reveals overlaps that help prioritize hits, highlight tractable series, and posit ways to combine aspects of multiple hits. The structure-activity relationship of a large and complex hit is automatically mapped onto all constituent scaffolds making it possible to navigate, via any shared scaffold, to all related hits. This scaffold "walking" helps address bias toward a handful of potent and ligand-efficient molecules at the expense of coverage of chemical space. We consider two scaffold generation methods and explored their similarities and differences both qualitatively and quantitatively. The workflow of a Spotfire visualization used in combination with fuzzy clustering and structure annotation provides an intuitive view of large and diverse screening datasets. This allows teams to effortlessly navigate between structurally related molecules and enriches the population of leads considered and progressed in a manner complementary to established approaches.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cluster Analysis
  • Datasets as Topic
  • Drug Discovery* / methods
  • Fuzzy Logic
  • Humans
  • Ligands
  • Small Molecule Libraries / chemistry*
  • Small Molecule Libraries / pharmacology

Substances

  • Ligands
  • Small Molecule Libraries