Chemometrical strategies for feature selection and data compression applied to NIR and MIR spectra of extra virgin olive oils for cultivar identification

Talanta. 2010 Mar 15;80(5):1832-7. doi: 10.1016/j.talanta.2009.10.030. Epub 2009 Oct 24.

Abstract

The possibility provided by Chemometrics to extract and combine (fusion) information contained in NIR and MIR spectra in order to discriminate monovarietal extra virgin olive oils according to olive cultivar (Casaliva, Leccino, Frantoio) has been investigated. Linear discriminant analysis (LDA) was applied as a classification technique on these multivariate and non-specific spectral data both separately and jointly (NIR and MIR data together). In order to ensure a more appropriate ratio between the number of objects (samples) and number of variables (absorbance at different wavenumbers), LDA was preceded either by feature selection or variable compression. For feature selection, the SELECT algorithm was used while a wavelet transform was applied for data compression. Correct classification rates obtained by cross-validation varied between 60% and 90% depending on the followed procedure. Most accurate results were obtained using the fused NIR and MIR data, with either feature selection or data compression. Chemometrical strategies applied to fused NIR and MIR spectra represent an effective method for classification of extra virgin olive oils on the basis of the olive cultivar.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Olive Oil
  • Plant Oils / analysis*
  • Spectroscopy, Fourier Transform Infrared / methods*
  • Spectroscopy, Near-Infrared / methods*

Substances

  • Olive Oil
  • Plant Oils