Assessment of tautomer distribution using the condensed reaction graph approach

J Comput Aided Mol Des. 2018 Mar;32(3):401-414. doi: 10.1007/s10822-018-0101-6. Epub 2018 Jan 29.

Abstract

We report the first direct QSPR modeling of equilibrium constants of tautomeric transformations (logK T ) in different solvents and at different temperatures, which do not require intermediate assessment of acidity (basicity) constants for all tautomeric forms. The key step of the modeling consisted in the merging of two tautomers in one sole molecular graph ("condensed reaction graph") which enables to compute molecular descriptors characterizing entire equilibrium. The support vector regression method was used to build the models. The training set consisted of 785 transformations belonging to 11 types of tautomeric reactions with equilibrium constants measured in different solvents and at different temperatures. The models obtained perform well both in cross-validation (Q2 = 0.81 RMSE = 0.7 logK T units) and on two external test sets. Benchmarking studies demonstrate that our models outperform results obtained with DFT B3LYP/6-311 ++ G(d,p) and ChemAxon Tautomerizer applicable only in water at room temperature.

Keywords: Condensed graphs of reaction; QSPR; Support vector regression; Tautomerism.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computer Simulation*
  • Isomerism
  • Molecular Structure
  • Quantitative Structure-Activity Relationship
  • Solvents / chemistry*
  • Temperature*
  • Thermodynamics
  • Water / chemistry

Substances

  • Solvents
  • Water