A consistent dataset of kinetic solubilities for early-phase drug discovery

ChemMedChem. 2009 Sep;4(9):1529-36. doi: 10.1002/cmdc.200900205.

Abstract

Herein, we describe a new dataset of kinetic aqueous solubilities determined by nephelometry for 711 druglike compounds. The solubilities are reported in twelve classes ranging from <2 microg mL(-1) to >250 microg mL(-1). The measurements were designed to provide the appropriate data for applications in the early phases of drug discovery. Three class classification models (insoluble, moderately soluble, soluble) were built using the random forest algorithm and their performance for this dataset was analyzed.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Drug Discovery*
  • Nephelometry and Turbidimetry
  • Organic Chemicals / chemistry
  • Organic Chemicals / classification
  • Solubility

Substances

  • Organic Chemicals