Factor analysis of error in oxidation potential calculation: A machine learning study

J Comput Chem. 2022 Aug 15;43(22):1504-1512. doi: 10.1002/jcc.26953. Epub 2022 Jun 28.

Abstract

The conductor-like polarizable continuum model (C-PCM), which is a low-cost solvation model, cannot treat characteristic interactions between the solvent and substructure(s) of the solute. Moreover, the error in a charged system is significant. Using machine learning, we clarified that the systematic error of the oxidation potential calculated by the G3B3/C-PCM was correlated with the molecular size of a solute. The G3B3/C-PCM overestimated the Gibbs oxidation energy by averaging 6.94 kcal/mol. According to the performance of related methods reported in previous studies, this error is mainly due to the solvation energy of the charged solute. Additionally, we succeeded in reducing the error to 2.27 kcal/mol (32%)-3.2 kcal/mol (40%) by correction based on the substructure information of the solute. To modify the C-PCM, effects that correlate with the molecular size of the solute in the charged system should be incorporated.

Keywords: B3LYP; C-PCM; G3B3; machine learning; redox potential.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Factor Analysis, Statistical
  • Machine Learning*
  • Models, Molecular
  • Solutions
  • Solvents / chemistry
  • Thermodynamics

Substances

  • Solutions
  • Solvents