Enhanced reproducibility of SADI web service workflows with Galaxy and Docker

Gigascience. 2015 Dec 3:4:59. doi: 10.1186/s13742-015-0092-3. eCollection 2015.

Abstract

Background: Semantic Web technologies have been widely applied in the life sciences, for example by data providers such as OpenLifeData and through web services frameworks such as SADI. The recently reported OpenLifeData2SADI project offers access to the vast OpenLifeData data store through SADI services.

Findings: This article describes how to merge data retrieved from OpenLifeData2SADI with other SADI services using the Galaxy bioinformatics analysis platform, thus making this semantic data more amenable to complex analyses. This is demonstrated using a working example, which is made distributable and reproducible through a Docker image that includes SADI tools, along with the data and workflows that constitute the demonstration.

Conclusions: The combination of Galaxy and Docker offers a solution for faithfully reproducing and sharing complex data retrieval and analysis workflows based on the SADI Semantic web service design patterns.

Keywords: Docker; Galaxy; RDF; Reproducibility; SADI; Semantic Web; Web service; Workflow.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology / methods*
  • Information Dissemination
  • Information Storage and Retrieval* / methods
  • Reproducibility of Results
  • Software*