Dual stacked partial least squares for analysis of near-infrared spectra

Anal Chim Acta. 2013 Aug 20:792:19-27. doi: 10.1016/j.aca.2013.07.008. Epub 2013 Jul 9.

Abstract

A new ensemble learning algorithm is presented for quantitative analysis of near-infrared spectra. The algorithm contains two steps of stacked regression and Partial Least Squares (PLS), termed Dual Stacked Partial Least Squares (DSPLS) algorithm. First, several sub-models were generated from the whole calibration set. The inner-stack step was implemented on sub-intervals of the spectrum. Then the outer-stack step was used to combine these sub-models. Several combination rules of the outer-stack step were analyzed for the proposed DSPLS algorithm. In addition, a novel selective weighting rule was also involved to select a subset of all available sub-models. Experiments on two public near-infrared datasets demonstrate that the proposed DSPLS with selective weighting rule provided superior prediction performance and outperformed the conventional PLS algorithm. Compared with the single model, the new ensemble model can provide more robust prediction result and can be considered an alternative choice for quantitative analytical applications.

Keywords: Ensemble learning; Multivariate calibration; Near-infrared spectra; Partial least squares; Selective weighting rule.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Least-Squares Analysis*
  • Models, Statistical*
  • Spectroscopy, Near-Infrared*