Presenting and sharing clinical data using the eTRIKS Standards Master Tree for tranSMART

Bioinformatics. 2019 May 1;35(9):1562-1565. doi: 10.1093/bioinformatics/bty809.

Abstract

Motivation: Standardization and semantic alignment have been considered one of the major challenges for data integration in clinical research. The inclusion of the CDISC SDTM clinical data standard into the tranSMART i2b2 via a guiding master ontology tree positively impacts and supports the efficacy of data sharing, visualization and exploration across datasets.

Results: We present here a schema for the organization of SDTM variables into the tranSMART i2b2 tree along with a script and test dataset to exemplify the mapping strategy. The eTRIKS master tree concept is demonstrated by making use of fictitious data generated for four patients, including 16 SDTM clinical domains. We describe how the usage of correct visit names and data labels can help to integrate multiple readouts per patient and avoid ETL crashes when running a tranSMART loading routine.

Availability and implementation: The eTRIKS Master Tree package and test datasets are publicly available at https://doi.org/10.5281/zenodo.1009098 and a functional demo installation at https://public.etriks.org/transmart/datasetExplorer/ under eTRIKS-Master Tree branch, where the discussed examples can be visualized.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Data Accuracy
  • Data Collection
  • Humans
  • Information Dissemination
  • Information Storage and Retrieval*