Estimation of the size of drug-like chemical space based on GDB-17 data

J Comput Aided Mol Des. 2013 Aug;27(8):675-9. doi: 10.1007/s10822-013-9672-4. Epub 2013 Aug 21.

Abstract

The goal of this paper is to estimate the number of realistic drug-like molecules which could ever be synthesized. Unlike previous studies based on exhaustive enumeration of molecular graphs or on combinatorial enumeration preselected fragments, we used results of constrained graphs enumeration by Reymond to establish a correlation between the number of generated structures (M) and the number of heavy atoms (N): logM = 0.584 × N × logN + 0.356. The number of atoms limiting drug-like chemical space of molecules which follow Lipinsky's rules (N = 36) has been obtained from the analysis of the PubChem database. This results in M ≈ 10³³ which is in between the numbers estimated by Ertl (10²³) and by Bohacek (10⁶⁰).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Databases, Pharmaceutical*
  • Molecular Structure
  • Pharmaceutical Preparations / chemistry*

Substances

  • Pharmaceutical Preparations