The development of large-scale de-identified biomedical databases in the age of genomics-principles and challenges

Hum Genomics. 2018 Apr 10;12(1):19. doi: 10.1186/s40246-018-0147-5.

Abstract

Contemporary biomedical databases include a wide range of information types from various observational and instrumental sources. Among the most important features that unite biomedical databases across the field are high volume of information and high potential to cause damage through data corruption, loss of performance, and loss of patient privacy. Thus, issues of data governance and privacy protection are essential for the construction of data depositories for biomedical research and healthcare. In this paper, we discuss various challenges of data governance in the context of population genome projects. The various challenges along with best practices and current research efforts are discussed through the steps of data collection, storage, sharing, analysis, and knowledge dissemination.

Keywords: Biomedical database; Data governance; Data privacy; Whole genome sequencing.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Biomedical Research / trends*
  • Databases, Genetic*
  • Genomics*
  • Humans