The Korea Cancer Big Data Platform (K-CBP) for Cancer Research

Int J Environ Res Public Health. 2019 Jun 28;16(13):2290. doi: 10.3390/ijerph16132290.

Abstract

Data warehousing is the most important technology to address recent advances in precision medicine. However, a generic clinical data warehouse does not address unstructured and insufficient data. In precision medicine, it is essential to develop a platform that can collect and utilize data. Data were collected from electronic medical records, genomic sequences, tumor biopsy specimens, and national cancer control initiative databases in the National Cancer Center (NCC), Korea. Data were de-identified and stored in a safe and independent space. Unstructured clinical data were standardized and incorporated into cancer registries and linked to cancer genome sequences and tumor biopsy specimens. Finally, national cancer control initiative data from the public domain were independently organized and linked to cancer registries. We constructed a system for integrating and providing various cancer data called the Korea Cancer Big Data Platform (K-CBP). Although the K-CBP could be used for cancer research, the legal and regulatory aspects of data distribution and usage need to be addressed first. Nonetheless, the system will continue collecting data from cancer-related resources that will hopefully facilitate precision-based research.

Keywords: big data platform; cancer data; clinical cancer registry; de-identification.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Big Data*
  • Databases, Factual*
  • Electronic Health Records*
  • Humans
  • Neoplasms / therapy*
  • Precision Medicine
  • Registries
  • Republic of Korea