Building a collaborative cloud platform to accelerate heart, lung, blood, and sleep research

J Am Med Inform Assoc. 2023 Jun 20;30(7):1293-1300. doi: 10.1093/jamia/ocad048.

Abstract

Research increasingly relies on interrogating large-scale data resources. The NIH National Heart, Lung, and Blood Institute developed the NHLBI BioData CatalystⓇ (BDC), a community-driven ecosystem where researchers, including bench and clinical scientists, statisticians, and algorithm developers, find, access, share, store, and compute on large-scale datasets. This ecosystem provides secure, cloud-based workspaces, user authentication and authorization, search, tools and workflows, applications, and new innovative features to address community needs, including exploratory data analysis, genomic and imaging tools, tools for reproducibility, and improved interoperability with other NIH data science platforms. BDC offers straightforward access to large-scale datasets and computational resources that support precision medicine for heart, lung, blood, and sleep conditions, leveraging separately developed and managed platforms to maximize flexibility based on researcher needs, expertise, and backgrounds. Through the NHLBI BioData Catalyst Fellows Program, BDC facilitates scientific discoveries and technological advances. BDC also facilitated accelerated research on the coronavirus disease-2019 (COVID-19) pandemic.

Keywords: cloud computing; data analysis; data-driven science; reproducibility of results; team science.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • COVID-19*
  • Cloud Computing*
  • Ecosystem
  • Humans
  • Lung
  • Reproducibility of Results
  • Software