Probabilistic partial least squares regression for quantitative analysis of Raman spectra

Int J Data Min Bioinform. 2015;11(2):223-43. doi: 10.1504/ijdmb.2015.066768.

Abstract

With the latest development of Surface-Enhanced Raman Scattering (SERS) technique, quantitative analysis of Raman spectra has shown the potential and promising trend of development in vivo molecular imaging. Partial Least Squares Regression (PLSR) is state-of-the-art method. But it only relies on training samples, which makes it difficult to incorporate complex domain knowledge. Based on probabilistic Principal Component Analysis (PCA) and probabilistic curve fitting idea, we propose a probabilistic PLSR (PPLSR) model and an Estimation Maximisation (EM) algorithm for estimating parameters. This model explains PLSR from a probabilistic viewpoint, describes its essential meaning and provides a foundation to develop future Bayesian nonparametrics models. Two real Raman spectra datasets were used to evaluate this model, and experimental results show its effectiveness.

Publication types

  • Evaluation Study
  • Validation Study

MeSH terms

  • Algorithms*
  • Complex Mixtures / analysis*
  • Complex Mixtures / chemistry*
  • Computer Simulation
  • Data Interpretation, Statistical
  • Least-Squares Analysis
  • Models, Statistical*
  • Pattern Recognition, Automated / methods
  • Regression Analysis*
  • Reproducibility of Results
  • Sensitivity and Specificity
  • Spectrum Analysis, Raman / methods*

Substances

  • Complex Mixtures