Estimation of wheat protein content and wet gluten content based on fusion of hyperspectral and RGB sensors using machine learning algorithms

Food Chem. 2024 Aug 1:448:139103. doi: 10.1016/j.foodchem.2024.139103. Epub 2024 Mar 22.

Abstract

The protein content (PC) and wet gluten content (WGC) are crucial indicators determining the quality of wheat, playing a pivotal role in evaluating processing and baking performance. Original reflectance (OR), wavelet feature (WF), and color index (CI) were extracted from hyperspectral and RGB sensors. Combining Pearson-competitive adaptive reweighted sampling (CARs)-variance inflation factor (VIF) with four machine learning (ML) algorithms were used to model accuracy of PC and WGC. As a result, three CIs, six ORs, and twelve WFs were selected for PC and WGC datasets. For single-modal data, the back-propagation neural network exhibited superior accuracy, with estimation accuracies (WF > OR > CI). For multi-modal data, the random forest regression paired with OR + WF + CI showed the highest validation accuracy. Utilizing the Gini impurity, WF outweighed OR and CI in the PC and WGC models. The amalgamation of MLs with multimodal data harnessed the synergies among various remote sensing sources, substantially augmenting model precision and stability.

Keywords: Machine learning; Multi-modal data; Pearson-CARs-VIF; Protein content; Wet gluten content; Wheat.

Publication types

  • Evaluation Study

MeSH terms

  • Algorithms*
  • Glutens* / analysis
  • Glutens* / chemistry
  • Machine Learning*
  • Plant Proteins* / analysis
  • Plant Proteins* / chemistry
  • Triticum* / chemistry

Substances

  • Glutens
  • Plant Proteins