Diagnosis Prediction from Electronic Health Records Using the Binary Diagnosis History Vector Representation

J Comput Biol. 2017 Aug;24(8):767-786. doi: 10.1089/cmb.2017.0023. Epub 2017 Jul 7.

Abstract

Large amounts of rich, heterogeneous information nowadays routinely collected by healthcare providers across the world possess remarkable potential for the extraction of novel medical data and the assessment of different practices in real-world conditions. Specifically in this work, our goal is to use electronic health records (EHRs) to predict progression patterns of future diagnoses of ailments for a particular patient, given the patient's present diagnostic history. Following the highly promising results of a recently proposed approach that introduced the diagnosis history vector representation of a patient's diagnostic record, we introduce a series of improvements to the model and conduct thorough experiments that demonstrate its scalability, accuracy, and practicability in the clinical context. We show that the model is able to capture well the interaction between a large number of ailments that correspond to the most frequent diagnoses, show how the original learning framework can be adapted to increase its prediction specificity, and describe a principled, probabilistic method for incorporating explicit, human clinical knowledge to overcome semantic limitations of the raw EHR data.

Keywords: Bayesian; EMRs; disease; electronic medical records; epidemiology; risk.

MeSH terms

  • Databases, Factual*
  • Diagnosis, Computer-Assisted / methods*
  • Electronic Health Records*
  • Humans
  • Models, Theoretical
  • Patient Admission
  • Prognosis