Comparison of feature selection and data fusion of Fourier transform infrared and Raman spectroscopy for identifying watercolor ink

J Forensic Sci. 2024 Mar;69(2):584-592. doi: 10.1111/1556-4029.15468. Epub 2024 Jan 30.

Abstract

The identification of different kinds of watercolor inks is an important work in the field of forensic science. Four different kinds of watercolor ink Spectroscopy data fusion strategies (Fourier Transform Infrared spectroscopy and Raman spectroscopy) combined with a non-linear classification model (Extreme Learning Machine) were used to identify the brand of watercolor inks. The study chose Competitive Adaptive Reweighted Sampling (CARS), Random Frog (RF), Variable Combination Population Analysis-Genetic Algorithm (VCPA-GA), and Variable Combination Population Analysis-Iteratively Retains Informative Variables (VCPA-IRIV) to extract characteristic variables for mid-level data fusion. The Cuckoo Search (CS) algorithm is used to optimize the extreme learning machine classification model. The results showed that the classification capacity of the mid-level fusion spectra model was more satisfactory than that of single Infrared spectroscopy or Raman spectroscopy. The CS-ELM models based on infrared spectroscopy used to recognize the watercolor ink according to brands (ZHENCAI, DELI, CHENGUANG, and STAEDTLER) obtained an accuracy of 66.67% in the test set using all spectral datasets. The accuracy of CS-ELM models based on Raman spectroscopy was 67.39%. The characteristic wavelength selection algorithms effectively improved the accuracy of the CS-ELM models. The classification accuracy of the mid-level spectroscopy fusion model combined with the VCPA-IRIV algorithm was 100%. The data fusion method increased effectively spectral information. The method could satisfactorily identify different brands of watercolor inks and support the preservation of artifacts, paintings, and forensic document examination.

Keywords: cuckoo search extreme learning machine; fourier transform infrared spectroscopy; mid-level data fusion; raman spectroscopy; watercolor ink.