1H NMR chemometric models for classification of Czech wine type and variety

Food Chem. 2021 Mar 1:339:127852. doi: 10.1016/j.foodchem.2020.127852. Epub 2020 Aug 17.

Abstract

A set of 917 wines of Czech origin were analysed using nuclear magnetic resonance spectroscopy (NMR) with the aim of building and evaluating multivariate statistical models and machine learning methods for the classification of 6 types based on colour and residual sugar content, 13 wine grape varieties and 4 locations based on 1H NMR spectra. The predictive models afforded greater than 93% correctness for classifying dry and medium dry, medium, and sweet white wines and dry red wines. The trained Random Forest (RF) model classified Pinot noir with 96% correctness, Blaufränkisch 96%, Riesling 92%, Cabernet Sauvignon 77%, Chardonnay 76%, Gewürtztraminer 60%, Hibernal 60%, Grüner Veltliner 52%, Pinot gris 48%, Sauvignon Blanc 45%, and Pálava 40%. Pinot blanc and Chardonnay, varieties that are often mistakenly interchanged, were discriminated with 71% correctness. The findings support chemometrics as a tool for predicting important features in wine, particularly for quality assessment and fraud detection.

Keywords: (1)H NMR; 2,3-Butanediol (Compound CID: 262); Acetate (Compound CID: 175); Acetoacetate (Compound CID: 6971017); Acetoin (Compound CID: 179); Alanine (Compound CID: 5950); Catechin (Compound CID: 9064); Chemometrics; Chlorogenate (Compound CID: 1794427); Choline (Compound CID: 305); Epicatechin (Compound CID: 72276); Ethanol (Compound CID: 702); Ethyl Acetate (Compound CID: 8857); Formate (Compound CID: 283); Fructose (Compound CID: 2723872); Gallate (Compound CID: 370); Glucose (Compound CID: 5793); Glutamate (Compound CID: 33032); Glycerol (Compound CID: 753); Histidin (Compound CID: 6274); Isobutanol (PubChem CID: 6560); Isoleucine (Compound CID: 6306); Isopentanol (Compound CID: 31260); Lactate (Compound CID: 612); Leucine (Compound CID: 6106); Methanol (Compound CID: 887); Methionine (Compound CID: 876); Myo-Inositol (Compound CID: 892); Phenethyl alcohol (Compound CID: 57361413); Phenylalanine (Compound CID: 6140); Proline (Compound CID: 145742); Pyruvate (Compound CID: 107735); Succinate (Compound CID: 160419); Tartrate (Compound CID: 3806114); Trigonellin (Compound CID: 5570); Turanose (Compound CID: 5460935); Tyrosine (Compound CID: 6057); Uridine (Compound CID: 6029); Wine analysis; Wine classification; p-Hydroxyphenylacetic acid (Compound CID: 127); randomForest.

MeSH terms

  • Cluster Analysis
  • Color
  • Czech Republic
  • Discriminant Analysis
  • Principal Component Analysis
  • Proton Magnetic Resonance Spectroscopy / methods*
  • Vitis / chemistry
  • Vitis / metabolism
  • Wine / analysis*