Ensemble preprocessing of near-infrared (NIR) spectra for multivariate calibration

Anal Chim Acta. 2008 Jun 2;616(2):138-43. doi: 10.1016/j.aca.2008.04.031. Epub 2008 Apr 20.

Abstract

Preprocessing of raw near-infrared (NIR) spectral data is indispensable in multivariate calibration when the measured spectra are subject to significant noises, baselines and other undesirable factors. However, due to the lack of sufficient prior information and an incomplete knowledge of the raw data, NIR spectra preprocessing in multivariate calibration is still trial and error. How to select a proper method depends largely on both the nature of the data and the expertise and experience of the practitioners. This might limit the applications of multivariate calibration in many fields, where researchers are not very familiar with the characteristics of many preprocessing methods unique in chemometrics and have difficulties to select the most suitable methods. Another problem is many preprocessing methods, when used alone, might degrade the data in certain aspects or lose some useful information while improving certain qualities of the data. In order to tackle these problems, this paper proposes a new concept of data preprocessing, ensemble preprocessing method, where partial least squares (PLSs) models built on differently preprocessed data are combined by Monte Carlo cross validation (MCCV) stacked regression. Little or no prior information of the data and expertise are required. Moreover, fusion of complementary information obtained by different preprocessing methods often leads to a more stable and accurate calibration model. The investigation of two real data sets has demonstrated the advantages of the proposed method.

MeSH terms

  • Calibration
  • Databases, Factual
  • Food Analysis / methods
  • Meat / analysis
  • Monte Carlo Method
  • Multivariate Analysis
  • Predictive Value of Tests
  • Regression Analysis
  • Reproducibility of Results
  • Seeds / chemistry
  • Sensitivity and Specificity
  • Signal Processing, Computer-Assisted*
  • Spectroscopy, Near-Infrared / instrumentation
  • Spectroscopy, Near-Infrared / methods*
  • Triticum / chemistry