Informatics tools to advance the biology of glycosaminoglycans and proteoglycans

Methods Mol Biol. 2015:1229:271-87. doi: 10.1007/978-1-4939-1714-3_23.

Abstract

Glycomics researchers have identified the need for integrated database systems for collecting glycomics information in a consistent format. The goal is to create a resource for knowledge discovery and dissemination to wider research communities. This has the potential to extend the research community to include biologists, clinicians, chemists, and computer scientists. This chapter discusses the technology and approach needed to create integrated data resources to empower the broader community to leverage extant glycomics data. The focus is on glycosaminoglycan (GAGs) and proteoglycan research, but the approach can be generalized. The methods described span the development of glycomics standards from CarbBank to Glyco Connection Tables. The existence of integrated data sets provides a foundation for novel methods of analysis such as machine learning for knowledge discovery. The implications of predictive analysis are examined in relation to disease biomarker to expand the target audience of GAG and proteoglycan research.

MeSH terms

  • Artificial Intelligence
  • Computational Biology / methods*
  • Glycosaminoglycans / chemistry*
  • Models, Molecular
  • Proteoglycans / chemistry*
  • Systems Integration

Substances

  • Glycosaminoglycans
  • Proteoglycans