GlyComb: A novel glycoconjugate data repository that bridges glycomics and proteomics

J Biol Chem. 2024 Feb;300(2):105624. doi: 10.1016/j.jbc.2023.105624. Epub 2024 Jan 3.

Abstract

The glycosylation of proteins and lipids is known to be closely related to the mechanisms of various diseases such as influenza, cancer, and muscular dystrophy. Therefore, it has become clear that the analysis of post-translational modifications of proteins, including glycosylation, is important to accurately understand the functions of each protein molecule and the interactions among them. In order to conduct large-scale analyses more efficiently, it is essential to promote the accumulation, sharing, and reuse of experimental and analytical data in accordance with the FAIR (Findability, Accessibility, Interoperability, and Re-usability) data principles. However, a FAIR data repository for storing and sharing glycoconjugate information, including glycopeptides and glycoproteins, in a standardized format did not exist. Therefore, we have developed GlyComb (https://glycomb.glycosmos.org) as a new standardized data repository for glycoconjugate data. Currently, GlyComb can assign a unique identifier to a set of glycosylation information associated with a specific peptide sequence or UniProt ID. By standardizing glycoconjugate data via GlyComb identifiers and coordinating with existing web resources such as GlyTouCan and GlycoPOST, a comprehensive system for data submission and data sharing among researchers can be established. Here we introduce how GlyComb is able to integrate the variety of glycoconjugate data already registered in existing data repositories to obtain a better understanding of the available glycopeptides and glycoproteins, and their glycosylation patterns. We also explain how this system can serve as a foundation for a better understanding of glycan function.

Keywords: bioinformatics; glycobiology; glycoconjugate; glycomics; glycoprotein; proteomics; repository.

MeSH terms

  • Databases, Chemical*
  • Databases, Genetic
  • Glycomics*
  • Glycopeptides / metabolism
  • Glycoproteins / metabolism
  • Glycosylation
  • Polysaccharides / metabolism
  • Proteomics*

Substances

  • Glycopeptides
  • Glycoproteins
  • Polysaccharides