Combining results of multiple search engines in proteomics

Mol Cell Proteomics. 2013 Sep;12(9):2383-93. doi: 10.1074/mcp.R113.027797. Epub 2013 May 29.

Abstract

A crucial component of the analysis of shotgun proteomics datasets is the search engine, an algorithm that attempts to identify the peptide sequence from the parent molecular ion that produced each fragment ion spectrum in the dataset. There are many different search engines, both commercial and open source, each employing a somewhat different technique for spectrum identification. The set of high-scoring peptide-spectrum matches for a defined set of input spectra differs markedly among the various search engine results; individual engines each provide unique correct identifications among a core set of correlative identifications. This has led to the approach of combining the results from multiple search engines to achieve improved analysis of each dataset. Here we review the techniques and available software for combining the results of multiple search engines and briefly compare the relative performance of these techniques.

Publication types

  • Research Support, American Recovery and Reinvestment Act
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Review

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Humans
  • Peptides / metabolism
  • Proteomics / methods*
  • ROC Curve
  • Search Engine*
  • Software

Substances

  • Peptides