Data analysis strategy for maximizing high-confidence protein identifications in complex proteomes such as human tumor secretomes and human serum

J Proteome Res. 2011 Nov 4;10(11):4993-5005. doi: 10.1021/pr200464c. Epub 2011 Oct 18.

Abstract

Detection of biologically interesting, low-abundance proteins in complex proteomes such as serum typically requires extensive fractionation and high-performance mass spectrometers. Processing of the resulting large data sets involves trade-offs between confidence of identification and depth of protein coverage; that is, higher stringency filters preferentially reduce the number of low-abundance proteins identified. In the current study, an alternative database search and results filtering strategies were evaluated using test samples ranging from purified proteins to ovarian tumor secretomes and human serum to maximize peptide and protein coverage. Full and partial tryptic searches were compared because substantial numbers of partial tryptic peptides were observed in all samples, and the proportion of partial tryptic peptides was particularly high for serum. When data filters that yielded similar false discovery rates (FDR) were used, full tryptic searches detected far fewer peptides than partial tryptic searches. In contrast to the common practice of using full tryptic specificity and a narrow precursor mass tolerance, more proteins and peptides could be confidently identified using a partial tryptic database search with a 100 ppm precursor mass tolerance followed by filtering of results using 10 ppm mass error and full tryptic boundaries.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Amino Acid Sequence
  • Blood Proteins / chemistry*
  • Data Interpretation, Statistical*
  • False Positive Reactions
  • Female
  • Humans
  • Molecular Sequence Data
  • Neoplasm Proteins / chemistry
  • Neoplasm Proteins / metabolism*
  • Ovarian Neoplasms / chemistry
  • Ovarian Neoplasms / metabolism*
  • Peptide Fragments / chemistry
  • Protein Processing, Post-Translational
  • Proteolysis
  • Proteome / chemistry*
  • Tandem Mass Spectrometry

Substances

  • Blood Proteins
  • Neoplasm Proteins
  • Peptide Fragments
  • Proteome