Querying clinical data in HL7 RIM based relational model with morph-RDB

J Biomed Semantics. 2017 Oct 5;8(1):49. doi: 10.1186/s13326-017-0155-8.

Abstract

Background: Semantic interoperability is essential when carrying out post-genomic clinical trials where several institutions collaborate, since researchers and developers need to have an integrated view and access to heterogeneous data sources. One possible approach to accommodate this need is to use RDB2RDF systems that provide RDF datasets as the unified view. These RDF datasets may be materialized and stored in a triple store, or transformed into RDF in real time, as virtual RDF data sources. Our previous efforts involved materialized RDF datasets, hence losing data freshness.

Results: In this paper we present a solution that uses an ontology based on the HL7 v3 Reference Information Model and a set of R2RML mappings that relate this ontology to an underlying relational database implementation, and where morph-RDB is used to expose a virtual, non-materialized SPARQL endpoint over the data.

Conclusions: By applying a set of optimization techniques on the SPARQL-to-SQL query translation algorithm, we can now issue SPARQL queries to the underlying relational data with generally acceptable performance.

Keywords: Clinical data; R2RML; SPARQL.

MeSH terms

  • Algorithms
  • Biological Ontologies*
  • Databases, Factual*
  • Humans
  • Information Storage and Retrieval / methods*
  • Internet
  • Semantics
  • Software*