Machine learning-based typing of Salmonella enterica O-serogroups by the Fourier-Transform Infrared (FTIR) Spectroscopy-based IR Biotyper system

J Microbiol Methods. 2022 Oct:201:106564. doi: 10.1016/j.mimet.2022.106564. Epub 2022 Sep 6.

Abstract

Background: Salmonella enterica is among the major burdens for public health at global level. Typing of salmonellae below the species level is fundamental for different purposes, but traditional methods are expensive, technically demanding, and time-consuming, and therefore limited to reference centers. Fourier transform infrared (FTIR) spectroscopy is an alternative method for bacterial typing, successfully applied for classification at different infra-species levels.

Aim: This study aimed to address the challenge of subtyping Salmonella enterica at O-serogroup level by using FTIR spectroscopy. We applied machine learning to develop a novel approach for S. enterica typing, using the FTIR-based IR Biotyper® system (IRBT; Bruker Daltonics GmbH & Co. KG, Germany). We investigated a multicentric collection of isolates, and we compared the novel approach with classical serotyping-based and molecular methods.

Methods: A total of 958 well characterized Salmonella isolates (25 serogroups, 138 serovars), collected in 11 different centers (in Europe and Japan), from clinical, environmental and food samples were included in this study and analyzed by IRBT. Infrared absorption spectra were acquired from water-ethanol bacterial suspensions, from culture isolates grown on seven different agar media. In the first part of the study, the discriminatory potential of the IRBT system was evaluated by comparison with reference typing method/s. In the second part of the study, the artificial intelligence capabilities of the IRBT software were applied to develop a classifier for Salmonella isolates at serogroup level. Different machine learning algorithms were investigated (artificial neural networks and support vector machine). A subset of 88 pre-characterized isolates (corresponding to 25 serogroups and 53 serovars) were included in the training set. The remaining 870 samples were used as validation set. The classifiers were evaluated in terms of accuracy, error rate and failed classification rate.

Results: The classifier that provided the highest accuracy in the cross-validation was selected to be tested with four external testing sets. Considering all the testing sites, accuracy ranged from 97.0% to 99.2% for non-selective media, and from 94.7% to 96.4% for selective media.

Conclusions: The IRBT system proved to be a very promising, user-friendly, and cost-effective tool for Salmonella typing at serogroup level. The application of machine learning algorithms proved to enable a novel approach for typing, which relies on automated analysis and result interpretation, and it is therefore free of potential human biases. The system demonstrated a high robustness and adaptability to routine workflows, without the need of highly trained personnel, and proving to be suitable to be applied with isolates grown on different agar media, both selective and unselective. Further tests with currently circulating clinical, food and environmental isolates would be necessary before implementing it as a potentially stand-alone standard method for routine use.

Keywords: FT-IR spectroscopy; FTIRS; IR Biotyper; Machine learning; Salmonella enterica; Salmonella typing.

MeSH terms

  • Agar
  • Artificial Intelligence
  • Bacterial Typing Techniques / methods
  • Culture Media
  • Ethanol
  • Humans
  • Machine Learning
  • Salmonella
  • Salmonella enterica*
  • Serogroup
  • Spectroscopy, Fourier Transform Infrared / methods
  • Water

Substances

  • Culture Media
  • Water
  • Ethanol
  • Agar