META-PLS modelling: An integrated approach to automatic model optimization for near-infrared spectra

Anal Chim Acta. 2022 Aug 15:1221:340142. doi: 10.1016/j.aca.2022.340142. Epub 2022 Jul 11.

Abstract

Predictive latent space near-infrared (NIR) spectral modelling with PLS (Partial Least Squares) has two main tasks that require user input to achieve optimal models. The first is the selection of the optimal pre-processing of NIR spectra and the second is the selection of the optimal number of PLS model components assuming the data is outlier free. Often the two tasks are performed in an exhaustive search to find the best pre-processing as well as the optimal number of model components. We propose a novel approach called meta partial least square (META-PLS) which drops the need for both the pre-processing optimisation and exhaustive search for optimal model components. We utilise the stepwise nature of the PLS algorithm to learn complementary information from different pre-processed forms of the same data set as performed in multiblock pre-processing ensemble models to avoid pre-processing selection but receive help from the pre-processing ensembles, and deploy a weighted randomisation test to decide the optimal number of model components automatically. The performance of the approach for performing automatic NIR spectral modelling is demonstrated with several real data sets.

Keywords: Chemometrics; Multivariate; Non-destructive; Spectroscopy.

MeSH terms

  • Algorithms*
  • Least-Squares Analysis
  • Spectroscopy, Near-Infrared*