Discrimination of corn variety using Terahertz spectroscopy combined with chemometrics methods

Spectrochim Acta A Mol Biomol Spectrosc. 2021 May 5:252:119475. doi: 10.1016/j.saa.2021.119475. Epub 2021 Jan 16.

Abstract

High-oil corn is a high-quality variety of corn possessing higher oil content with greater caloric energy than normal corn. Hence, controlling the purity and authenticity of high-oil corn is of great importance in current crop research. The aim of this study is to develop a novel method for corn variety discrimination using Terahertz (THz) spectroscopy and signal classification analysis. In brief, the method involves feature extraction and variable selection of raw signals from Terahertz time-domain waveforms (THz-TDW) and absorption spectrum (THz-AS), and the use of classifiers on those treated signals to establish the discrimination models. Principle component analysis (PCA) were used for feature extraction with THz-TDW, while three different methods of variable selection were implemented with THz-AS, including uninformative variables elimination (UVE), uninformative variables elimination-successive projections algorithm (UVE-SPA) and competitive adaptive reweighted sampling (CARS). Then, two classification algorithms, Linear discriminant analysis (LDA) and support vector machine (SVM), were employed and compared in the discrimination models. Bootstrapped Latin partitions (BLP) method with 10 bootstraps and 5 Latin-partitions was applied to validate these models. Our modeling results suggest SVM as the better classification algorithm achieving higher identifying accuracy, such that the PCA-SVM model for THz-TDW has achieved 94.7% accuracy. The results also indicate variable selection as an important step to create an accurate and robust discrimination model for THZ-AS. The CARS-SVM model with radial basic function (RBF) has achieved 100% average accuracy in prediction set, while the UVE-SVM and UVE-SPA-SVM have achieved 91.2% and 99.1% accuracy, respectively. These results demonstrate that high-oil corn and normal corn can be identified successfully by using THz spectroscopy with discriminant analysis, suggesting our techniques to provide an efficient and practical reference for classifying crop varieties in agriculture research, while expanding the application of THz spectroscopy in the related field.

Keywords: Corn variety discrimination; Feature extraction; SVM; Terahertz spectroscopy; Variables selection.

MeSH terms

  • Algorithms
  • Discriminant Analysis
  • Least-Squares Analysis
  • Principal Component Analysis
  • Support Vector Machine
  • Terahertz Spectroscopy*
  • Zea mays*