Optimizing the procedure of grain nutrient predictions in barley via hyperspectral imaging

PLoS One. 2019 Nov 7;14(11):e0224491. doi: 10.1371/journal.pone.0224491. eCollection 2019.

Abstract

Hyperspectral imaging enables researchers and plant breeders to analyze various traits of interest like nutritional value in high throughput. In order to achieve this, the optimal design of a reliable calibration model, linking the measured spectra with the investigated traits, is necessary. In the present study we investigated the impact of different regression models, calibration set sizes and calibration set compositions on prediction performance. For this purpose, we analyzed concentrations of six globally relevant grain nutrients of the wild barley population HEB-YIELD as case study. The data comprised 1,593 plots, grown in 2015 and 2016 at the locations Dundee and Halle, which have been entirely analyzed through traditional laboratory methods and hyperspectral imaging. The results indicated that a linear regression model based on partial least squares outperformed neural networks in this particular data modelling task. There existed a positive relationship between the number of samples in a calibration model and prediction performance, with a local optimum at a calibration set size of ~40% of the total data. The inclusion of samples from several years and locations could clearly improve the predictions of the investigated nutrient traits at small calibration set sizes. It should be stated that the expansion of calibration models with additional samples is only useful as long as they are able to increase trait variability. Models obtained in a certain environment were only to a limited extent transferable to other environments. They should therefore be successively upgraded with new calibration data to enable a reliable prediction of the desired traits. The presented results will assist the design and conceptualization of future hyperspectral imaging projects in order to achieve reliable predictions. It will in general help to establish practical applications of hyperspectral imaging systems, for instance in plant breeding concepts.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Breeding / statistics & numerical data
  • Calibration
  • Edible Grain / growth & development
  • Edible Grain / metabolism*
  • Hordeum / growth & development
  • Hordeum / metabolism*
  • Least-Squares Analysis
  • Linear Models
  • Nutrients / genetics
  • Nutrients / metabolism*
  • Nutritive Value
  • Phenotype
  • Plant Structures / genetics
  • Plant Structures / metabolism*

Grants and funding

This work was financially supported by the German Research Foundation (DFG) via the priority program 1530: Flowering time control - from natural variation to crop improvement (grants Pi339/7-1 and Pi339/7-2 to MW, KP, AM) and via ERA-NET for Coordinating Action in Plant Sciences (ERA-CAPS) (grant Pi339/8-1 to WT, AF, KP, AM) and via the German Federal Ministry of Research and Education (BMBF) IPAS grant BARLEY-DIVERSITY (grant FZ 031A352A to AB, US, KP). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.