ProteoStorm: An Ultrafast Metaproteomics Database Search Framework

Cell Syst. 2018 Oct 24;7(4):463-467.e6. doi: 10.1016/j.cels.2018.08.009. Epub 2018 Sep 26.

Abstract

Shotgun metaproteomics has the potential to reveal the functional landscape of microbial communities but lacks appropriate methods for complex samples with unknown compositions. In the absence of prior taxonomic information, tandem mass spectra would be searched against large pan-microbial databases, which requires heavy computational workload and reduces sensitivity. We present ProteoStorm, an efficient database search framework for large-scale metaproteomics studies, which identifies high-confidence peptide-spectrum matches (PSMs) while achieving a two-to-three orders-of-magnitude speedup over popular tools. A reanalysis of a urinary tract infection (UTI) dataset of 110 individuals revealed a complex pattern of polymicrobial expression, including sub-types of UTIs, cases of bacterial vaginosis, and evidence of no underlying disease. Importantly, compared to the initial UTI study that restricted the search database to a manually curated list of 20 genera, ProteoStorm identified additional genera that were previously unreported, including a case of infection with the rare pathogen Propionimicrobium.

Keywords: LC-MS/MS; metaproteomics; microbial communities; microbiology; proteome informatics; proteomics.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Databases, Genetic
  • Humans
  • Metagenome*
  • Microbiota / genetics
  • Proteomics / methods*
  • Proteomics / standards
  • Software*
  • Urinary Tract Infections / microbiology