Predicting Melting Points of Organic Molecules: Applications to Aqueous Solubility Prediction Using the General Solubility Equation

Mol Inform. 2015 Nov;34(11-12):715-24. doi: 10.1002/minf.201500052. Epub 2015 Jul 20.

Abstract

In this work we make predictions of several important molecular properties of academic and industrial importance to seek answers to two questions: 1) Can we apply efficient machine learning techniques, using inexpensive descriptors, to predict melting points to a reasonable level of accuracy? 2) Can values of this level of accuracy be usefully applied to predicting aqueous solubility? We present predictions of melting points made by several novel machine learning models, previously applied to solubility prediction. Additionally, we make predictions of solubility via the General Solubility Equation (GSE) and monitor the impact of varying the logP prediction model (AlogP and XlogP) on the GSE. We note that the machine learning models presented, using a modest number of 2D descriptors, can make melting point predictions in line with the current state of the art prediction methods (RMSE≥40 °C). We also find that predicted melting points, with an RMSE of tens of degrees Celsius, can be usefully applied to the GSE to yield accurate solubility predictions (log10 S RMSE<1) over a small dataset of drug-like molecules.

Keywords: Machine learning; Melting points; Pharmaceuticals; QSPR; Solubility.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Databases, Chemical*
  • Freezing*
  • Machine Learning*
  • Models, Chemical*
  • Solubility