Design of the LifeArc Index Set and Retrospective Review of Its Performance: A Collection for Sharing

SLAS Discov. 2019 Mar;24(3):332-345. doi: 10.1177/2472555218803696. Epub 2018 Oct 5.

Abstract

Building, curating, and maintaining a compound collection is an expensive operation, beyond the scope of most academic organizations. Here we describe the selection criteria used to compile the LifeArc diversity set from commercial suppliers and the process we undertook to generate our representative LifeArc index set. The aim was to avoid a "junk in, junk out" screen collection to increase chemical tractability going forward, while maximizing diversity. Using historical LifeArc screening data, we demonstrate that the index set was predictive of ligandability and that progressable hits could be identified by mining associated clusters within our larger diversity set. Indeed, a higher percentage of index-derived hit clusters were found to have been progressed into hit-to-lead programs, reflecting better drug-likeness. In practice, the library has been shared widely with academic groups and used routinely within LifeArc to assess the ligandability of novel targets. Its small size is well suited to meet the needs of medium-throughput screening in labs with either limited automation, limited precious or expensive reagents, or complex cellular assays. The strategy of screening a small set in combination with rapid hit analog follow-up has demonstrated the utility of finding active clusters for potential development against challenging targets.

Keywords: chemoinformatics; compound library design; high-throughput screening.

Publication types

  • Review

MeSH terms

  • Data Mining
  • Databases, Chemical
  • Drug Discovery / methods*
  • High-Throughput Screening Assays / methods*
  • Retrospective Studies