Machine learning for classifying and predicting grape maturity indices using absorbance and fluorescence spectra

Food Chem. 2023 Mar 1:403:134321. doi: 10.1016/j.foodchem.2022.134321. Epub 2022 Sep 20.

Abstract

Absorbance-transmission and fluorescence excitation-emission matrix (A-TEEM) spectroscopy was investigated as a rapid method for predicting maturity indices using Cabernet Sauvignon grapes produced under four viticulture treatments during two growing seasons. Machine learning models were developed with fused spectral data to predict 3-isobutyl-2-methoxypyrazine (IBMP), pH, total tannins (Tannin), total soluble solids (TSS), and malic and tartaric acids based on the results from traditional analysis methods. Extreme gradient boosting (XGB) regression yielded R2 values of 0.92-0.96 for IBMP, malic acid, pH, and TSS for externally validated (Test) models, with partial least squares regression being superior for TSS prediction (R2 = 0.97). R2 values of 0.64-0.81 were achieved with either approach for tartaric acid and Tannin predictions. Classification of grape maturity, defined by quantile ranges for red colour, IBMP, malic acid, and TSS, was investigated using XGB discriminant analysis, providing an average of 78 % correctly classified samples for the Test model.

Keywords: A-TEEM; Chemometrics; Data fusion; Discriminant analysis; Regression; XGBoost.

MeSH terms

  • Machine Learning
  • Tannins / analysis
  • Vitis* / chemistry
  • Wine* / analysis

Substances

  • malic acid
  • Tannins