CHIPMUNK: A Virtual Synthesizable Small-Molecule Library for Medicinal Chemistry, Exploitable for Protein-Protein Interaction Modulators

ChemMedChem. 2018 Mar 20;13(6):532-539. doi: 10.1002/cmdc.201700689. Epub 2018 Feb 20.

Abstract

A common issue during drug design and development is the discovery of novel scaffolds for protein targets. On the one hand the chemical space of purchasable compounds is rather limited; on the other hand artificially generated molecules suffer from a grave lack of accessibility in practice. Therefore, we generated a novel virtual library of small molecules which are synthesizable from purchasable educts, called CHIPMUNK (CHemically feasible In silico Public Molecular UNiverse Knowledge base). Altogether, CHIPMUNK covers over 95 million compounds and encompasses regions of the chemical space that are not covered by existing databases. The coverage of CHIPMUNK exceeds the chemical space spanned by the Lipinski rule of five to foster the exploration of novel and difficult target classes. The analysis of the generated property space reveals that CHIPMUNK is well suited for the design of protein-protein interaction inhibitors (PPIIs). Furthermore, a recently developed structural clustering algorithm (StruClus) for big data was used to partition the sub-libraries into meaningful subsets and assist scientists to process the large amount of data. These clustered subsets also contain the target space based on ChEMBL data which was included during clustering.

Keywords: beyond rule of five; heterocycles; in silico reactions; multiple component reactions; protein-protein interaction inhibitors; virtual libraries.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Chemistry, Pharmaceutical
  • Cluster Analysis
  • Drug Design
  • Protein Binding / drug effects
  • Proteins / antagonists & inhibitors
  • Proteins / chemistry*
  • Small Molecule Libraries / chemical synthesis
  • Small Molecule Libraries / chemistry*
  • Small Molecule Libraries / pharmacology*

Substances

  • Proteins
  • Small Molecule Libraries