Refining comparative proteomics by spectral counting to account for shared peptides and multiple search engines

Anal Bioanal Chem. 2012 Sep;404(4):1115-25. doi: 10.1007/s00216-012-6011-x. Epub 2012 May 3.

Abstract

Spectral counting has become a widely used approach for measuring and comparing protein abundance in label-free shotgun proteomics. However, when analyzing complex samples, the ambiguity of matching between peptides and proteins greatly affects the assessment of peptide and protein inventories, differentiation, and quantification. Meanwhile, the configuration of database searching algorithms that assign peptides to MS/MS spectra may produce different results in comparative proteomic analysis. Here, we present three strategies to improve comparative proteomics through spectral counting. We show that comparing spectral counts for peptide groups rather than for protein groups forestalls problems introduced by shared peptides. We demonstrate the advantage and flexibility of this new method in two datasets. We present four models to combine four popular search engines that lead to significant gains in spectral counting differentiation. Among these models, we demonstrate a powerful vote counting model that scales well for multiple search engines. We also show that semi-tryptic searching outperforms tryptic searching for comparative proteomics. Overall, these techniques considerably improve protein differentiation on the basis of spectral count tables.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Databases, Protein
  • Escherichia coli Proteins / chemistry*
  • Escherichia coli Proteins / genetics
  • Humans
  • Peptides / chemistry*
  • Proteins / chemistry*
  • Proteins / genetics
  • Proteomics / methods*
  • Search Engine / methods*
  • Software

Substances

  • Escherichia coli Proteins
  • Peptides
  • Proteins