Brute-Force Approach for Mass Spectrometry-Based Variant Peptide Identification in Proteogenomics without Personalized Genomic Data

J Am Soc Mass Spectrom. 2018 Feb;29(2):435-438. doi: 10.1007/s13361-017-1859-9. Epub 2018 Jan 3.

Abstract

In a proteogenomic approach based on tandem mass spectrometry analysis of proteolytic peptide mixtures, customized exome or RNA-seq databases are employed for identifying protein sequence variants. However, the problem of variant peptide identification without personalized genomic data is important for a variety of applications. Following the recent proposal by Chick et al. (Nat. Biotechnol. 33, 743-749, 2015) on the feasibility of such variant peptide search, we evaluated two available approaches based on the previously suggested "open" search and the "brute-force" strategy. To improve the efficiency of these approaches, we propose an algorithm for exclusion of false variant identifications from the search results involving analysis of modifications mimicking single amino acid substitutions. Also, we propose a de novo based scoring scheme for assessment of identified point mutations. In the scheme, the search engine analyzes y-type fragment ions in MS/MS spectra to confirm the location of the mutation in the variant peptide sequence. Graphical abstract ᅟ.

Keywords: LC-MS/MS; Open search; Proteogenomics; Search engine.

MeSH terms

  • Algorithms
  • Amino Acid Substitution*
  • Animals
  • Chromatography, Liquid
  • Databases, Protein
  • Humans
  • Peptides / chemistry
  • Peptides / genetics*
  • Point Mutation
  • Proteins / chemistry
  • Proteins / genetics
  • Proteogenomics / methods*
  • Search Engine
  • Tandem Mass Spectrometry

Substances

  • Peptides
  • Proteins