The technology behind TB DEPOT: a novel public analytics platform integrating tuberculosis clinical, genomic, and radiological data for visual and statistical exploration

J Am Med Inform Assoc. 2021 Jan 15;28(1):71-79. doi: 10.1093/jamia/ocaa228.

Abstract

Objective: Clinical research informatics tools are necessary to support comprehensive studies of infectious diseases. The National Institute of Allergy and Infectious Diseases (NIAID) developed the publicly accessible Tuberculosis Data Exploration Portal (TB DEPOT) to address the complex etiology of tuberculosis (TB).

Materials and methods: TB DEPOT displays deidentified patient case data and facilitates analyses across a wide range of clinical, socioeconomic, genomic, and radiological factors. The solution is built using Amazon Web Services cloud-based infrastructure, .NET Core, Angular, Highcharts, R, PLINK, and other custom-developed services. Structured patient data, pathogen genomic variants, and medical images are integrated into the solution to allow seamless filtering across data domains.

Results: Researchers can use TB DEPOT to query TB patient cases, create and save patient cohorts, and execute comparative statistical analyses on demand. The tool supports user-driven data exploration and fulfills the National Institute of Health's Findable, Accessible, Interoperable, and Reusable (FAIR) principles.

Discussion: TB DEPOT is the first tool of its kind in the field of TB research to integrate multidimensional data from TB patient cases. Its scalable and flexible architectural design has accommodated growth in the data, organizations, types of data, feature requests, and usage. Use of client-side technologies over server-side technologies and prioritizing maintenance have been important lessons learned. Future directions are dynamically prioritized and key functionality is shared through an application programming interface.

Conclusion: This paper describes the platform development methodology, resulting functionality, benefits, and technical considerations of a clinical research informatics application to support increased understanding of TB.

Keywords: clinical research informatics; cohort creation; data analysis; data integration; tuberculosis.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Computational Biology
  • Databases as Topic
  • Genomics
  • Humans
  • Internet*
  • Medical Informatics Applications*
  • National Institute of Allergy and Infectious Diseases (U.S.)
  • Radiology
  • Software
  • Tuberculosis* / diagnosis
  • Tuberculosis* / drug therapy
  • Tuberculosis* / genetics
  • Tuberculosis* / prevention & control
  • United States