DiagnoTop: A Computational Pipeline for Discriminating Bacterial Pathogens without Database Search

J Am Soc Mass Spectrom. 2021 Jun 2;32(6):1295-1299. doi: 10.1021/jasms.1c00014. Epub 2021 Apr 15.

Abstract

Pathogen identification is crucial to confirm bacterial infections and guide antimicrobial therapy. Although MALDI-TOF mass spectrometry (MS) serves as foundation for tools that enable rapid microbial identification, some bacteria remain challenging to identify. We recently showed that top-down proteomics (TDP) could be used to discriminate closely related enterobacterial pathogens (Escherichia coli, Shigella, and Salmonella) that are indistinguishable with tools rooted in the MALDI-TOF MS approach. Current TDP diagnostic relies on the identification of specific proteoforms for each species through a database search. However, microbial proteomes are often poorly annotated, which complicates the large-scale identification of proteoforms and leads to many unidentified high-quality mass spectra. Here, we describe a new computational pipeline called DiagnoTop that lists discriminative spectral clusters found in TDP data sets that can be used for microbial diagnostics without database search. Applied to our enterobacterial TDP data sets, DiagnoTop could easily shortlist high-quality discriminative spectral clusters, leading to increased diagnostic power. This pipeline opens new perspectives in clinical microbiology and biomarker discovery using TDP.

Keywords: clinical microbiology; diagnostics; enterobacterial pathogens; top-down proteomics.

MeSH terms

  • Bacteria / chemistry*
  • Bacteria / pathogenicity*
  • Computational Biology / methods*
  • Databases, Factual
  • Enterobacteriaceae / chemistry
  • Enterobacteriaceae / pathogenicity
  • Knowledge Bases
  • Proteomics / methods
  • Software*
  • Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization
  • Tandem Mass Spectrometry / methods*
  • Workflow