Multiple Layers of Complexity in O-Glycosylation Illustrated With the Urinary Glycoproteome

Mol Cell Proteomics. 2022 Dec;21(12):100439. doi: 10.1016/j.mcpro.2022.100439. Epub 2022 Nov 9.

Abstract

While N-glycopeptides are relatively easy to characterize, O-glycosylation analysis is more complex. In this article, we illustrate the multiple layers of O-glycopeptide characterization that make this task so challenging. We believe our carefully curated dataset represents perhaps the largest intact human glycopeptide mixture derived from individuals, not from cell lines. The samples were collected from healthy individuals, patients with superficial or advanced bladder cancer (three of each group), and a single bladder inflammation patient. The data were scrutinized manually and interpreted using three different search engines: Byonic, Protein Prospector, and O-Pair, and the tool MS-Filter. Despite all the recent advances, reliable automatic O-glycopeptide assignment has not been solved yet. Our data reveal such diversity of site-specific O-glycosylation that has not been presented before. In addition to the potential biological implications, this dataset should be a valuable resource for software developers in the same way as some of our previously released data has been used in the development of O-Pair and O-Glycoproteome Analyzer. Based on the manual evaluation of the performance of the existing tools with our data, we lined up a series of recommendations that if implemented could significantly improve the reliability of glycopeptide assignments.

Keywords: EThcD; mass spectrometry; misidentification; search engines; urinary O-glycopeptides.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Glycopeptides / analysis
  • Glycosylation
  • Humans
  • Proteome / chemistry
  • Reproducibility of Results
  • Search Engine*
  • Software*

Substances

  • Glycopeptides
  • Proteome