Precision De Novo Peptide Sequencing Using Mirror Proteases of Ac-LysargiNase and Trypsin for Large-scale Proteomics

Mol Cell Proteomics. 2019 Apr;18(4):773-785. doi: 10.1074/mcp.TIR118.000918. Epub 2019 Jan 8.

Abstract

De novo peptide sequencing for large-scale proteomics remains challenging because of the lack of full coverage of ion series in tandem mass spectra. We developed a mirror protease of trypsin, acetylated LysargiNase (Ac-LysargiNase), with superior activity and stability. The mirror spectrum pairs derived from the Ac-LysargiNase and trypsin treated samples can generate full b and y ion series, which provide mutual complementarity of each other, and allow us to develop a novel algorithm, pNovoM, for de novo sequencing. Using pNovoM to sequence peptides of purified proteins, the accuracy of the sequence was close to 100%. More importantly, from a large-scale yeast proteome sample digested with trypsin and Ac-LysargiNase individually, 48% of all tandem mass spectra formed mirror spectrum pairs, 97% of which contained full coverage of ion series, resulting in precision de novo sequencing of full-length peptides by pNovoM. This enabled pNovoM to successfully sequence 21,249 peptides from 3,753 proteins and interpreted 44-152% more spectra than pNovo+ and PEAKS at a 5% FDR at the spectrum level. Moreover, the mirror protease strategy had an obvious advantage in sequencing long peptides. We believe that the combination of mirror protease strategy and pNovoM will be an effective approach for precision de novo sequencing on both single proteins and proteome samples.

Keywords: Ac-LysargiNase; De novo sequencing; Enzyme catalysis*; Mass Spectrometry; Mirror proteases; Peptide mass fingerprinting; Protein engineering; Trypsin.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Acetylation
  • Amino Acid Sequence
  • Antibodies, Monoclonal / metabolism
  • Enzyme Stability
  • Metalloproteases / metabolism*
  • Peptides / chemistry
  • Peptides / metabolism*
  • Proteome / metabolism
  • Proteomics / methods*
  • Sequence Analysis, Protein / methods*
  • Trypsin / metabolism*

Substances

  • Antibodies, Monoclonal
  • Peptides
  • Proteome
  • Metalloproteases
  • Trypsin