EnzymeML-a data exchange format for biocatalysis and enzymology

FEBS J. 2022 Oct;289(19):5864-5874. doi: 10.1111/febs.16318. Epub 2021 Dec 26.

Abstract

EnzymeML is an XML-based data exchange format that supports the comprehensive documentation of enzymatic data by describing reaction conditions, time courses of substrate and product concentrations, the kinetic model, and the estimated kinetic constants. EnzymeML is based on the Systems Biology Markup Language, which was extended by implementing the STRENDA Guidelines. An EnzymeML document serves as a container to transfer data between experimental platforms, modeling tools, and databases. EnzymeML supports the scientific community by introducing a standardized data exchange format to make enzymatic data findable, accessible, interoperable, and reusable according to the FAIR data principles. An application programming interface in Python supports the integration of software tools for data acquisition, data analysis, and publication. The feasibility of a seamless data flow using EnzymeML is demonstrated by creating an EnzymeML document from a structured spreadsheet or from a STRENDA DB database entry, by kinetic modeling using the modeling platform COPASI, and by uploading to the enzymatic reaction kinetics database SABIO-RK.

Keywords: FAIR data principles; Python; Systems Biology Markup Language; XML; biocatalysis; bioinformatics; data exchange; enzymology; research data management.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biocatalysis
  • Databases, Factual
  • Software*