Automated data mining of the electronic health record for investigation of healthcare-associated outbreaks

Infect Control Hosp Epidemiol. 2019 Mar;40(3):314-319. doi: 10.1017/ice.2018.343. Epub 2019 Feb 18.

Abstract

Background: Identifying routes of transmission among hospitalized patients during a healthcare-associated outbreak can be tedious, particularly among patients with complex hospital stays and multiple exposures. Data mining of the electronic health record (EHR) has the potential to rapidly identify common exposures among patients suspected of being part of an outbreak.

Methods: We retrospectively analyzed 9 hospital outbreaks that occurred during 2011-2016 and that had previously been characterized both according to transmission route and by molecular characterization of the bacterial isolates. We determined (1) the ability of data mining of the EHR to identify the correct route of transmission, (2) how early the correct route was identified during the timeline of the outbreak, and (3) how many cases in the outbreaks could have been prevented had the system been running in real time.

Results: Correct routes were identified for all outbreaks at the second patient, except for one outbreak involving >1 transmission route that was detected at the eighth patient. Up to 40 or 34 infections (78% or 66% of possible preventable infections, respectively) could have been prevented if data mining had been implemented in real time, assuming the initiation of an effective intervention within 7 or 14 days of identification of the transmission route, respectively.

Conclusions: Data mining of the EHR was accurate for identifying routes of transmission among patients who were part of the outbreak. Prospective validation of this approach using routine whole-genome sequencing and data mining of the EHR for both outbreak detection and route attribution is ongoing.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Cross Infection / transmission*
  • Data Mining / methods*
  • Data Mining / statistics & numerical data
  • Disease Outbreaks / prevention & control*
  • Electronic Health Records / statistics & numerical data
  • Female
  • Hospitals / statistics & numerical data
  • Humans
  • Male
  • Retrospective Studies