Current Trends of Big Data Research Using the Korean National Health Information Database

Diabetes Metab J. 2022 Jul;46(4):552-563. doi: 10.4093/dmj.2022.0193. Epub 2022 Jul 27.

Abstract

Recently, medical research using big data has become very popular, and its value has become increasingly recognized. The Korean National Health Information Database (NHID) is representative of big data that combines information obtained from the National Health Insurance Service collected for claims and reimbursement of health care services and results obtained from general health examinations provided to all Korean adults. This database has several strengths and limitations. Given the large size, various laboratory data, and questionnaires obtained from medical check-ups, their longitudinal nature, and long-term accumulation of data since 2002, carefully designed studies may provide valuable information that is difficult to obtain from other forms of research. However, consideration of possible bias and careful interpretation when defining causal relationships is also important because the data were not collected for research purposes. After the NHID became publicly available, research and publications based on this database have increased explosively, especially in the field of diabetes and metabolism. This article reviews the history, structure, and characteristics of the Korean NHID. Recent trends in big data research using this database, commonly used operational diagnosis, and representative studies have been introduced. We expect further progress and expansion of big data research using the Korean NHID.

Keywords: Database; Diabetes mellitus; Korea; Metabolism; National health programs.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Big Data*
  • Databases, Factual
  • National Health Programs*
  • Republic of Korea / epidemiology