The assessment of data quality issues for process mining in healthcare using Medical Information Mart for Intensive Care III, a freely available e-health record database

Health Informatics J. 2019 Dec;25(4):1878-1893. doi: 10.1177/1460458218810760. Epub 2018 Nov 29.

Abstract

There is a growing body of literature on process mining in healthcare. Process mining of electronic health record systems could give benefit into better understanding of the actual processes happened in the patient treatment, from the event log of the hospital information system. Researchers report issues of data access approval, anonymisation constraints, and data quality. One solution to progress methodology development is to use a high-quality, freely available research dataset such as Medical Information Mart for Intensive Care III, a critical care database which contains the records of 46,520 intensive care unit patients over 12 years. Our article aims to (1) explore data quality issues for healthcare process mining using Medical Information Mart for Intensive Care III, (2) provide a structured assessment of Medical Information Mart for Intensive Care III data quality and challenge for process mining, and (3) provide a worked example of cancer treatment as a case study of process mining using Medical Information Mart for Intensive Care III to illustrate an approach and solution to data quality challenges. The electronic health record software was upgraded partway through the period over which data was collected and we use this event to explore the link between electronic health record system design and resulting process models.

Keywords: Medical Information Mart for Intensive Care III; data quality; healthcare; process mining.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Critical Care / methods
  • Critical Care / standards
  • Critical Care / statistics & numerical data
  • Data Accuracy*
  • Data Management / instrumentation
  • Data Management / methods
  • Data Management / statistics & numerical data
  • Data Mining / methods
  • Data Mining / standards*
  • Data Mining / statistics & numerical data
  • Delivery of Health Care / methods
  • Delivery of Health Care / standards
  • Delivery of Health Care / statistics & numerical data
  • Electronic Health Records / statistics & numerical data
  • Humans
  • Telemedicine / methods
  • Telemedicine / standards*
  • Telemedicine / statistics & numerical data