Error propagation of partial least squares for parameters optimization in NIR modeling

Spectrochim Acta A Mol Biomol Spectrosc. 2018 Mar 5:192:244-250. doi: 10.1016/j.saa.2017.10.069. Epub 2017 Oct 28.

Abstract

A novel methodology is proposed to determine the error propagation of partial least-square (PLS) for parameters optimization in near-infrared (NIR) modeling. The parameters include spectral pretreatment, latent variables and variable selection. In this paper, an open source dataset (corn) and a complicated dataset (Gardenia) were used to establish PLS models under different modeling parameters. And error propagation of modeling parameters for water quantity in corn and geniposide quantity in Gardenia were presented by both type І and type II error. For example, when variable importance in the projection (VIP), interval partial least square (iPLS) and backward interval partial least square (BiPLS) variable selection algorithms were used for geniposide in Gardenia, compared with synergy interval partial least squares (SiPLS), the error weight varied from 5% to 65%, 55% and 15%. The results demonstrated how and what extent the different modeling parameters affect error propagation of PLS for parameters optimization in NIR modeling. The larger the error weight, the worse the model. Finally, our trials finished a powerful process in developing robust PLS models for corn and Gardenia under the optimal modeling parameters. Furthermore, it could provide a significant guidance for the selection of modeling parameters of other multivariate calibration models.

Keywords: Error propagation; Modeling parameters; Multivariate detection limits; Near-infrared; Partial least-squares.

MeSH terms

  • Gardenia / chemistry
  • Iridoids / analysis
  • Least-Squares Analysis
  • Limit of Detection
  • Models, Theoretical*
  • Multivariate Analysis
  • Reference Standards
  • Spectroscopy, Near-Infrared / methods*
  • Water / chemistry
  • Zea mays / chemistry

Substances

  • Iridoids
  • Water
  • geniposide