Dealing with the Ambiguity of Glycan Substructure Search

Molecules. 2021 Dec 23;27(1):65. doi: 10.3390/molecules27010065.

Abstract

The level of ambiguity in describing glycan structure has significantly increased with the upsurge of large-scale glycomics and glycoproteomics experiments. Consequently, an ontology-based model appears as an appropriate solution for navigating these data. However, navigation is not sufficient and the model should also enable advanced search and comparison. A new ontology with a tree logical structure is introduced to represent glycan structures irrespective of the precision of molecular details. The model heavily relies on the GlycoCT encoding of glycan structures. Its implementation in the GlySTreeM knowledge base was validated with GlyConnect data and benchmarked with the Glycowork library. GlySTreeM is shown to be fast, consistent, reliable and more flexible than existing solutions for matching parts of or whole glycan structures. The model is also well suited for painless future expansion.

Keywords: glycan structure; knowledge representation; ontology; pattern recognition; semantic web.

MeSH terms

  • Databases, Factual
  • Glycomics / methods*
  • Molecular Structure
  • Polysaccharides / chemistry*
  • Structure-Activity Relationship
  • Web Browser

Substances

  • Polysaccharides