Development of an electronic health records datamart to support clinical and population health research

J Clin Transl Sci. 2020 Jun 23;5(1):e13. doi: 10.1017/cts.2020.499.

Abstract

Introduction: Electronic health record (EHR) data have emerged as an important resource for population health and clinical research. There have been significant efforts to leverage EHR data for research; however, given data security concerns and the complexity of the data, EHR data are frequently difficult to access and use for clinical studies. We describe the development of a Clinical Research Datamart (CRDM) that was developed to provide well-curated and easily accessible EHR data to Duke University investigators.

Methods: The CRDM was designed to (1) contain most of the patient-level data elements needed for research studies; (2) be directly accessible by individuals conducting statistical analyses (including Biostatistics, Epidemiology, and Research Design (BERD) core members); (3) be queried via a code-based system to promote reproducibility and consistency across studies; and (4) utilize a secure protected analytic workspace in which sensitive EHR data can be stored and analyzed. The CRDM utilizes data transformed for the PCORnet data network, and was augmented with additional data tables containing site-specific data elements to provide additional contextual information.

Results: We provide descriptions of ideal use cases and discuss dissemination and evaluation methods, including future work to expand the user base and track the use and impact of this data resource.

Conclusions: The CRDM utilizes resources developed as part of the Clinical and Translational Science Awards (CTSAs) program and could be replicated by other institutions with CTSAs.

Keywords: Electronic health records; PCORnet; common data model.