Prediction of Protein Concentration in Pea (Pisum sativum L.) Using Near-Infrared Spectroscopy (NIRS) Systems

Foods. 2022 Nov 18;11(22):3701. doi: 10.3390/foods11223701.

Abstract

Breeding for increased protein concentration is a priority in field peas. Having a quick, accurate, and non-destructive protein quantification method is critical for screening breeding materials, which the near-infrared spectroscopy (NIRS) system can provide. Partial least square regression (PLSR) models to predict protein concentration were developed and compared for DA7250 and FT9700 NIRS systems. The reference protein data were accurate and exhibited a wider range of variation (15.3−29.8%). Spectral pre-treatments had no clear advantage over analyses based on raw spectral data. Due to the large number of samples used in this study, prediction accuracies remained similar across calibration sizes. The final PLSR models for the DA7250 and FT9700 systems required 10 and 13 latent variables, respectively, and performed well and were comparable (R2 = 0.72, RMSE = 1.22, and bias = 0.003 for DA7250; R2 = 0.79, RMSE = 1.23, and bias = 0.055 for FT9700). Considering three groupings for protein concentration (Low: <20%, Medium: ≥20%, but ≤25%, and High: >25%), none of the entries changed from low to high or vice versa between the observed and predicted values for the DA7250 system. Only a single entry moved from a low category in the observed data to a high category in the predicted data for the FT9700 system in the calibration set. Although the FT9700 system outperformed the DA7250 system by a small margin, both systems had the potential to predict protein concentration in pea seeds for breeding purposes. Wavelengths between 950 nm and 1650 nm accounted for most of the variation in pea protein concentration.

Keywords: DA7250 system; FT9700 systems; NIRS; PLSR; calibration size; dumas method; protein prediction; spectral pre-treatment.