Wavenumber selection based on Singular Value Decomposition for sample classification

Forensic Sci Int. 2020 Apr:309:110191. doi: 10.1016/j.forsciint.2020.110191. Epub 2020 Feb 10.

Abstract

The dissemination of falsified medicines is a public health risk. Techniques such as attenuated total reflectance Fourier transform infrared (ATR-FTIR) spectroscopy are commonly adopted for fraudulent drug detection. However, the spectrum generated by the ATR-FTIR typically results in hundreds of wavenumbers, reducing the performance of classification methods aimed at discriminating between authentic and falsified medicines. This article proposes a novel method for selecting a reduced size subset of wavenumbers that improves the classifier performance. The singular value decomposition SVD is used to generate a wavenumber importance index. An iterative process creates k-nearest neighbor (KNN) models by adding the wavenumbers in a decreasing order according to the importance index. Wavenumbers that increase classification accuracy are selected. When applied to Cialis® ATR-FTIR data, the proposed approach retained average 0.51% of the original wavenumbers with 100% accurate classifications; as for the Viagra® data set, the method yielded perfect classifications retaining average 0.17% of the original wavenumbers.

Keywords: ATR-FTIR; Falsified medicines; KNN; SVD; Wavenumber selection.

Publication types

  • Review

MeSH terms

  • Algorithms
  • Counterfeit Drugs / chemistry*
  • Humans
  • Principal Component Analysis
  • Spectroscopy, Fourier Transform Infrared

Substances

  • Counterfeit Drugs