Rapid differentiation of Ghana cocoa beans by FT-NIR spectroscopy coupled with multivariate classification

Spectrochim Acta A Mol Biomol Spectrosc. 2013 Oct:114:183-9. doi: 10.1016/j.saa.2013.05.063. Epub 2013 May 29.

Abstract

Quick, accurate and reliable technique for discrimination of cocoa beans according to geographical origin is essential for quality control and traceability management. This current study presents the application of Near Infrared Spectroscopy technique and multivariate classification for the differentiation of Ghana cocoa beans. A total of 194 cocoa bean samples from seven cocoa growing regions were used. Principal component analysis (PCA) was used to extract relevant information from the spectral data and this gave visible cluster trends. The performance of four multivariate classification methods: Linear discriminant analysis (LDA), K-nearest neighbors (KNN), Back propagation artificial neural network (BPANN) and Support vector machine (SVM) were compared. The performances of the models were optimized by cross validation. The results revealed that; SVM model was superior to all the mathematical methods with a discrimination rate of 100% in both the training and prediction set after preprocessing with Mean centering (MC). BPANN had a discrimination rate of 99.23% for the training set and 96.88% for prediction set. While LDA model had 96.15% and 90.63% for the training and prediction sets respectively. KNN model had 75.01% for the training set and 72.31% for prediction set. The non-linear classification methods used were superior to the linear ones. Generally, the results revealed that NIR Spectroscopy coupled with SVM model could be used successfully to discriminate cocoa beans according to their geographical origins for effective quality assurance.

Keywords: Ghana cocoa beans; Near Infrared Spectroscopy; Support vector machine.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cacao / chemistry*
  • Discriminant Analysis
  • Ghana
  • Multivariate Analysis
  • Principal Component Analysis
  • Spectroscopy, Near-Infrared / methods*
  • Support Vector Machine