Combining De Novo Peptide Sequencing Algorithms, A Synergistic Approach to Boost Both Identifications and Confidence in Bottom-up Proteomics

J Proteome Res. 2017 Sep 1;16(9):3209-3218. doi: 10.1021/acs.jproteome.7b00198. Epub 2017 Aug 22.

Abstract

Complex mass spectrometry based proteomics data sets are mostly analyzed by protein database searches. While this approach performs considerably well for sequenced organisms, direct inference of peptide sequences from tandem mass spectra, i.e., de novo peptide sequencing, oftentimes is the only way to obtain information when protein databases are absent. However, available algorithms suffer from drawbacks such as lack of validation and often high rates of false positive hits (FP). Here we present a simple method of combining results from commonly available de novo peptide sequencing algorithms, which in conjunction with minor tweaks in data acquisition ensues lower empirical FDR compared to the analysis using single algorithms. Results were validated using state-of-the art database search algorithms as well specifically synthesized reference peptides. Thus, we could increase the number of PSMs meeting a stringent FDR of 5% more than 3-fold compared to the single best de novo sequencing algorithm alone, accounting for an average of 11 120 PSMs (combined) instead of 3476 PSMs (alone) in triplicate 2 h LC-MS runs of tryptic HeLa digestion.

Keywords: LC−MS/MS; bottom-up proteomics; de novo peptide sequencing; false discovery rate.

MeSH terms

  • Algorithms*
  • Amino Acid Sequence
  • Animals
  • Cell Line
  • Chromatography, Liquid
  • Databases, Protein
  • HeLa Cells
  • Humans
  • Mice
  • Myoblasts / chemistry
  • Myoblasts / metabolism
  • Peptides / analysis*
  • Proteolysis
  • Proteomics / instrumentation
  • Proteomics / methods*
  • Saccharomyces cerevisiae / chemistry
  • Saccharomyces cerevisiae / metabolism
  • Sequence Analysis, Protein / methods*
  • Snails / chemistry
  • Snails / metabolism
  • Tandem Mass Spectrometry
  • Trypsin / chemistry

Substances

  • Peptides
  • Trypsin