Alternative data mining/machine learning methods for the analytical evaluation of food quality and authenticity - A review

Ana M Jiménez-Carvelo; Antonio González-Casado; M Gracia Bagur-González; Luis Cuadros-Rodríguez

doi:10.1016/j.foodres.2019.03.063

Alternative data mining/machine learning methods for the analytical evaluation of food quality and authenticity - A review

Food Res Int. 2019 Aug:122:25-39. doi: 10.1016/j.foodres.2019.03.063. Epub 2019 Mar 28.

Authors

Ana M Jiménez-Carvelo¹, Antonio González-Casado², M Gracia Bagur-González², Luis Cuadros-Rodríguez²

Affiliations

¹ Department of Analytical Chemistry, Faculty of Science, University of Granada, C/ Fuentenueva s/n, E-18071 Granada, Spain. Electronic address: amariajc@ugr.es.
² Department of Analytical Chemistry, Faculty of Science, University of Granada, C/ Fuentenueva s/n, E-18071 Granada, Spain.

PMID: 31229078
DOI: 10.1016/j.foodres.2019.03.063

Abstract

In recent years, the variety and volume of data acquired by modern analytical instruments in order to conduct a better authentication of food has dramatically increased. Several pattern recognition tools have been developed to deal with the large volume and complexity of available trial data. The most widely used methods are principal component analysis (PCA), partial least squares-discriminant analysis (PLS-DA), soft independent modelling by class analogy (SIMCA), k-nearest neighbours (kNN), parallel factor analysis (PARAFAC), and multivariate curve resolution-alternating least squares (MCR-ALS). Nevertheless, there are alternative data treatment methods, such as support vector machine (SVM), classification and regression tree (CART) and random forest (RF), that show a great potential and more advantages compared to conventional ones. In this paper, we explain the background of these methods and review and discuss the reported studies in which these three methods have been applied in the area of food quality and authenticity. In addition, we clarify the technical terminology used in this particular area of research.

Keywords: CART; Data mining; Decision tree; Food analysis; Random forest.

Publication types

Review

MeSH terms

Data Mining / methods*
Decision Trees
Food Analysis / methods*
Food Quality*
Machine Learning*
Statistics as Topic