Automatic classification of Candida species using Raman spectroscopy and machine learning

Spectrochim Acta A Mol Biomol Spectrosc. 2023 Apr 5:290:122270. doi: 10.1016/j.saa.2022.122270. Epub 2022 Dec 22.

Abstract

One of the problems that most affect hospitals is infections by pathogenic microorganisms. Rapid identification and adequate, timely treatment can avoid fatal consequences and the development of antibiotic resistance, so it is crucial to use fast, reliable, and not too laborious techniques to obtain quick results. Raman spectroscopy has proven to be a powerful tool for molecular analysis, meeting these requirements better than traditional techniques. In this work, we have used Raman spectroscopy combined with machine learning algorithms to explore the automatic identification of eleven species of the genus Candida, the most common cause of fungal infections worldwide. The Raman spectra were obtained from more than 220 different measurements of dried drops from pure cultures of each Candida species using a Raman Confocal Microscope with a 532 nm laser excitation source. After developing a spectral preprocessing methodology, a study of the quality and variability of the measured spectra at the isolate and species level, and the spectral features contributing to inter-class variations, showed the potential to discriminate between those pathogenic yeasts. Several machine learning and deep learning algorithms were trained using hyperparameter optimization techniques to find the best possible classifier for this spectral data, in terms of accuracy and lowest possible overfitting. We found that a one-dimensional Convolutional Neural Network (1-D CNN) could achieve above 80 % overall accuracy for the eleven classes spectral dataset, with good generalization capabilities.

Keywords: Candida identification; Convolutional neural network; Machine learning; Overfitting; Raman spectroscopy.

MeSH terms

  • Algorithms
  • Candida*
  • Machine Learning
  • Neural Networks, Computer
  • Spectrum Analysis, Raman*