Statistical Aspects of Near-Infrared Spectroscopy for the Characterization of Errors and Model Building

Appl Spectrosc. 2017 Jul;71(7):1665-1676. doi: 10.1177/0003702817704587. Epub 2017 Apr 27.

Abstract

Due to the complex nature of near-infrared (NIR) spectra, it is usually very difficult to provide quantitative interpretations of spectral data. As a consequence, careful building and validation of calibration models are of fundamental importance prior to development of useful applications of NIR technologies. For this reason, this work presents a statistical study about the NIR spectroscopy, analyzing the NIR behavior when the experimental conditions are changed. Near-infrared spectra were measured at different temperatures and stirring velocities for systems containing a pure solvent and a suspension of polymer powder in order to perform the error analysis. Then, mixtures of xylene and toluene were analyzed through NIR at different temperatures and stirring velocities and the obtained data were used to build calibration models with multivariate techniques. The results showed that the precision of the NIR measurements depends on the analytical conditions and that unavoidable fluctuations of spectral data (or spectral data variability) are strongly correlated, leading to full covariance matrices of spectral fluctuations, which has been surprisingly neglected during quantitative analyses. In particular, modeling of the xylene/toluene NIR data performed with different multivariate techniques revealed that the principal directions are not preserved when the real covariance matrix of measurement errors is taken into account.

Keywords: NIR; Near-infrared spectroscopy; PCR; PLS; multivariate calibration; partial least squares; principal component regression; statistical error analysis.