Distinguishing pericarpium citri reticulatae of different origins using terahertz time-domain spectroscopy combined with convolutional neural networks

Spectrochim Acta A Mol Biomol Spectrosc. 2023 Oct 15:299:122771. doi: 10.1016/j.saa.2023.122771. Epub 2023 Apr 25.

Abstract

The geographical indication of pericarpium citri reticulatae (PCR) is very important in grading the quality and price of PCRs. Therefore, terahertz time-domain spectroscopy (THz-TDS) technology combined with convolutional neural networks (CNN) was proposed to distinguish PCRs of different origins without damage in this study. The one-dimensional CNN (1D-CNN) model with an accuracy of 82.99% based on spectral data processed with SNV was established. The two-dimensional image features were transformed from unprocessed spectral data using the gramian angular field (GAF), the Markov transition field (MTF) and the recurrence plot (RP), which were used to build a two-dimensional CNN (2D-CNN) model with an accuracy of 78.33%. Further, the CNN models with different fusion methods were developed for fusing spectra data and image data. In addition, the adding spectra and images based on the CNN (Add-CNN) model with an accuracy of 86.17% performed better. Eventually, the Add-CNN model based on ten frequencies extracted using permutation importance (PI) achieved the identification of PCRs from different origins. Overall, the current study would provide a new method for identifying PCRs of different origins, which was expected to be used for the traceability of PCRs products.

Keywords: Convolutional neural networks; Origins; Pericarpium citri reticulatae; Terahertz time-domain spectroscopy; Two-dimension image features.

MeSH terms

  • Citrus* / chemistry
  • Fruit* / chemistry
  • Spectrum Analysis*