Milk Source Identification and Milk Quality Estimation Using an Electronic Nose and Machine Learning Techniques

Sensors (Basel). 2020 Jul 30;20(15):4238. doi: 10.3390/s20154238.

Abstract

In this study, an electronic nose (E-nose) consisting of seven metal oxide semiconductor sensors is developed to identify milk sources (dairy farms) and to estimate the content of milk fat and protein which are the indicators of milk quality. The developed E-nose is a low cost and non-destructive device. For milk source identification, the features based on milk odor features from E-nose, composition features (Dairy Herd Improvement, DHI analytical data) from DHI analysis and fusion features are analyzed by principal component analysis (PCA) and linear discriminant analysis (LDA) for dimension reduction and then three machine learning algorithms, logistic regression (LR), support vector machine (SVM), and random forest (RF), are used to construct the classification model of milk source (dairy farm) identification. The results show that the SVM model based on the fusion features after LDA has the best performance with the accuracy of 95%. Estimation model of the content of milk fat and protein from E-nose features using gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost), and random forest (RF) are constructed. The results show that the RF models give the best performance (R2 = 0.9399 for milk fat; R2 = 0.9301 for milk protein) and indicate that the proposed method in this study can improve the estimation accuracy of milk fat and protein, which provides a technical basis for predicting the quality of milk.

Keywords: electronic nose; milk; quality estimation; source identification.

Publication types

  • Letter

MeSH terms

  • Algorithms
  • Animals
  • Discriminant Analysis
  • Electronic Nose*
  • Food Analysis / instrumentation*
  • Logistic Models
  • Machine Learning*
  • Milk / chemistry*
  • Support Vector Machine