Machine learning modeling and additive explanation techniques for glutathione production from multiple experimental growth conditions of Saccharomyces cerevisiae

Int J Biol Macromol. 2024 Mar;262(Pt 2):130035. doi: 10.1016/j.ijbiomac.2024.130035. Epub 2024 Feb 7.

Abstract

Glutathione (GSH) production is of great industrial interest due to its essential properties. This study aimed to use machine learning (ML) methods to model GSHproduction under different growth conditions of Saccharomyces cerevisiae, namely cultivation time, culture volume, pressure, and magnetic field application. Different ML and regression models were evaluated for their statistics to select the most robust model. Results showed that eXtreme Gradient Boosting (XGB) was the best predictive performance model. From the best model, additive explanation techniques were used to identify the feature importance of process. According to variable analysis, the best conditions to obtain the highest GSH concentrations would be cultivation times of 72-96 h, low magnetic field intensity (3.02 mT), low pressure (0.5 kgf.cm-2), and high culture volume (3.5 L). XGB use and additive explanation techniques proved promising for determining process optimization conditions and selecting the essential process variables.

Keywords: Feature interaction; Machine learning; Partial dependence plot; SHAP value; Stress conditions.

MeSH terms

  • Glutathione*
  • Industry
  • Light
  • Machine Learning
  • Saccharomyces cerevisiae*

Substances

  • Glutathione