Big data management challenges in health research-a literature review

Brief Bioinform. 2019 Jan 18;20(1):156-167. doi: 10.1093/bib/bbx086.

Abstract

Big data management for information centralization (i.e. making data of interest findable) and integration (i.e. making related data connectable) in health research is a defining challenge in biomedical informatics. While essential to create a foundation for knowledge discovery, optimized solutions to deliver high-quality and easy-to-use information resources are not thoroughly explored. In this review, we identify the gaps between current data management approaches and the need for new capacity to manage big data generated in advanced health research. Focusing on these unmet needs and well-recognized problems, we introduce state-of-the-art concepts, approaches and technologies for data management from computing academia and industry to explore improvement solutions. We explain the potential and significance of these advances for biomedical informatics. In addition, we discuss specific issues that have a great impact on technical solutions for developing the next generation of digital products (tools and data) to facilitate the raw-data-to-knowledge process in health research.

Publication types

  • Research Support, N.I.H., Intramural
  • Review

MeSH terms

  • Big Data*
  • Computational Biology / methods*
  • Computational Biology / statistics & numerical data
  • Computational Biology / trends
  • Database Management Systems / statistics & numerical data
  • Database Management Systems / trends
  • Humans
  • Knowledge Bases
  • Machine Learning
  • Research / statistics & numerical data