ProteinProcessor: A probabilistic analysis using mass accuracy and the MS spectrum

Proteomics. 2016 Sep;16(18):2480-90. doi: 10.1002/pmic.201600137.

Abstract

Current approaches to protein identification rely heavily on database matching of fragmentation spectra or precursor peptide ions. We have developed a method for MALDI TOF-TOF instrumentation that uses peptide masses and their measurement errors to confirm protein identifications from a first pass MS/MS database search. The method uses MS1-level spectral data that have heretofore been ignored by most search engines. This approach uses the distribution of mass errors of peptide matches in the MS1 spectrum to develop a probability model that is independent of the MS/MS database search identifications. Peptide mass matches can come from both precursor ions that have been fragmented as well as those that are tentatively identified by accurate mass alone. This additional corroboration enables us to confirm protein identifications to MS/MS-based scores that are otherwise considered to be only of moderate quality. Straightforward and easily applicable to current proteomic analyses, this tool termed "ProteinProcessor" provides a robust and invaluable addition to current protein identification tools.

Keywords: Bayesian analysis; Bioinformatics; MALDI; Protein sequencing; Tandem mass spectrometry.

Publication types

  • Research Support, N.I.H., Intramural

MeSH terms

  • Algorithms*
  • Animals
  • Databases, Protein
  • Humans
  • Mice
  • Models, Statistical
  • Peptide Mapping / methods*
  • Proteomics / methods*
  • Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization
  • Tandem Mass Spectrometry / methods*