A Customizable Importer for the Clinical Data Warehouses PaDaWaN and I2B2

Stud Health Technol Inform. 2017:243:90-94.

Abstract

In recent years, clinical data warehouses (CDW) storing routine patient data have become more and more popular to support scientific work in the medical domain. Although CDW systems provide interfaces to import new data, these interfaces have to be used by processing tools that are often not included in the systems themselves. In order to establish an extraction-transformation-load (ETL) workflow, already existing components have to be taken or new components have to be developed to perform the load part of the ETL. We present a customizable importer for the two CDW systems PaDaWaN and I2B2, which is able to import the most common import formats (plain text, CSV and XML files). In order to be run, the importer only needs a configuration file with the user credentials for the target CDW and a list of XML import configuration files, which determine how already exported data is indented to be imported. The importer is provided as a Java program, which has no further software requirements.

Keywords: ETL; data warehouse.

MeSH terms

  • Database Management Systems*
  • Electronic Health Records
  • Humans
  • Software*