Informatics Ecosystems to Advance the Biology of Glycans

Methods Mol Biol. 2022:2303:655-673. doi: 10.1007/978-1-0716-1398-6_50.

Abstract

Glycomics researchers have identified the need for integrated database systems for collecting glycomics information in a consistent format. The goal is to create a resource for knowledge discovery and dissemination to wider research communities. This has the potential and has exhibited initial success, to extend the research community to include biologists, clinicians, chemists, and computer scientists. This chapter discusses the technology and approach needed to create integrated data resources and informatics ecosystems to empower the broader community to leverage extant glycomics data. The focus is on glycosaminoglycan (GAGs) and proteoglycan research, but the approach can be generalized. The methods described span the development of glycomics standards from CarbBank to Glyco Connection Tables. Integrated data sets provide a foundation for novel methods of analysis such as machine learning and deep learning for knowledge discovery. The implications of predictive analysis are examined in relation to disease biomarker to expand the target audience of GAG and proteoglycan research.

Keywords: Data integration; Data representation; Deep learning; Glycosaminoglycan; Informatics; Machine learning; Proteoglycan.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Ecosystem*
  • Glycomics*
  • Informatics
  • Polysaccharides
  • Proteoglycans

Substances

  • Polysaccharides
  • Proteoglycans