Region of interest selection in heterogeneous digital image: Wine age prediction by comprehensive two-dimensional gas chromatography

Curr Res Food Sci. 2024 Mar 29:8:100725. doi: 10.1016/j.crfs.2024.100725. eCollection 2024.

Abstract

This study integrates genetic algorithm (GA) with partial least squares regression (PLSR) and various variable selection methods to identify impactful regions of interest (ROI) in heterogeneous 2D chromatogram images for predicting wine age. As wine quality and aroma evolve over time, transitioning from youthful fruitiness to mature, complex flavors, which leads to alterations in the composition of essential aroma-contributing compounds. Chromatograms are segmented into subimages, and the GA-PLSR algorithm optimizes combinations based on grayscale, red-green-blue (RGB), and hue-saturation-value (HSV) histograms. The selected subimage histograms are further refined through interval selection, highlighting the compounds with the most significant influence on wine aging. Experimental validation involving 38 wine samples demonstrates the effectiveness of this approach. Cross-validation reduces the PLS model error from 2.8 to 2.4 years within a 10 × 10 subset, and during prediction, the error decreases from 2.5 to 2.3 years. The study presents a novel approach utilizing the selection of ROI for efficient processing of 2D chromatograms focusing on predicting wine age.

Keywords: Comprehensive two-dimensional gas chromatography; Region of interest; Volatile organic compounds; Wine age; Wine analysis.