Validation of Top-Down Proteomics Data by Bottom-Up-Based N-Terminomics Reveals Pitfalls in Top-Down-Based Terminomics Workflows

J Proteome Res. 2022 Sep 2;21(9):2185-2196. doi: 10.1021/acs.jproteome.2c00277. Epub 2022 Aug 16.

Abstract

Bottom-up proteomics (BUP)-based N-terminomics techniques have become standard to identify protein N-termini. While these methods rely on the identification of N-terminal peptides only, top-down proteomics (TDP) comes with the promise to provide additional information about post-translational modifications and the respective C-termini. To evaluate the potential of TDP for terminomics, two established TDP workflows were employed for the proteome analysis of the nematode Caenorhabditis elegans. The N-termini of the identified proteoforms were validated using a BUP-based N-terminomics approach. The TDP workflows used here identified 1658 proteoforms, the N-termini of which were verified by BUP in 25% of entities only. Caveats in both the BUP- and TDP-based workflows were shown to contribute to this low overlap. In BUP, the use of trypsin prohibits the detection of arginine-rich or arginine-deficient N-termini, while in TDP, the formation of artificially generated termini was observed in particular in a workflow encompassing sample treatment with high acid concentrations. Furthermore, we demonstrate the applicability of reductive dimethylation in TDP to confirm biological N-termini. Overall, our study shows not only the potential but also current limitations of TDP for terminomics studies and also presents suggestions for future developments, for example, for data quality control, allowing improvement of the detection of protein termini by TDP.

Keywords: HUNTER; LC−MS; N-termini; bottom-up proteomics; data analysis; orbitrap; proteoforms; proteolysis; terminomics; top-down proteomics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arginine
  • DNA-Binding Proteins
  • Protein Processing, Post-Translational
  • Proteome* / analysis
  • Proteomics* / methods
  • Workflow

Substances

  • DNA-Binding Proteins
  • Proteome
  • Arginine