Reduction of the Number of Samples for Cost-Effective Hyperspectral Grape Quality Predictive Models

Julio Nogales-Bueno; Francisco José Rodríguez-Pulido; Berta Baca-Bocanegra; Dolores Pérez-Marin; Francisco José Heredia; Ana Garrido-Varo; José Miguel Hernández-Hierro

doi:10.3390/foods10020233

Reduction of the Number of Samples for Cost-Effective Hyperspectral Grape Quality Predictive Models

Foods. 2021 Jan 23;10(2):233. doi: 10.3390/foods10020233.

Authors

Julio Nogales-Bueno^{1

2}, Francisco José Rodríguez-Pulido¹, Berta Baca-Bocanegra¹, Dolores Pérez-Marin², Francisco José Heredia¹, Ana Garrido-Varo², José Miguel Hernández-Hierro¹

Affiliations

¹ Food Colour and Quality Laboratory, Área de Nutrición y Bromatología, Facultad de Farmacia, Universidad de Sevilla, 41012 Sevilla, Spain.
² Department of Animal Production, Campus de Rabanales, Universidad de Córdoba, 14071 Córdoba, Spain.

Abstract

Developing chemometric models from near-infrared (NIR) spectra requires the use of a representative calibration set of the entire population. Therefore, generally, the calibration procedure requires a large number of resources. For that reason, there is a great interest in identifying the most spectrally representative samples within a large population set. In this study, principal component and hierarchical clustering analyses have been compared for their ability to provide different representative calibration sets. The calibration sets generated have been used to control the technological maturity of grapes and total phenolic compounds of grape skins in red and white cultivars. Finally, the accuracy and precision of the models obtained with these calibration sets resulted from the application of the selection algorithms studied have been compared with each other and with the whole set of samples using an external validation set. Most of the standard errors of prediction (SEP) in external validation obtained from the reduced data sets were not significantly different from those obtained using the whole data set. Moreover, sample subsets resulting from hierarchical clustering analysis appear to produce slightly better results.

Keywords: chemometrics; grape quality; hyperspectral imaging; near-infrared; sample selection.

Abstract

Grants and funding