EMR-based medical knowledge representation and inference via Markov random fields and distributed representation learning

Artif Intell Med. 2018 May:87:49-59. doi: 10.1016/j.artmed.2018.03.005. Epub 2018 Apr 23.

Abstract

Objective: Electronic medical records (EMRs) contain medical knowledge that can be used for clinical decision support (CDS). Our objective is to develop a general system that can extract and represent knowledge contained in EMRs to support three CDS tasks-test recommendation, initial diagnosis, and treatment plan recommendation-given the condition of a patient.

Methods: We extracted four kinds of medical entities from records and constructed an EMR-based medical knowledge network (EMKN), in which nodes are entities and edges reflect their co-occurrence in a record. Three bipartite subgraphs (bigraphs) were extracted from the EMKN, one to support each task. One part of the bigraph was the given condition (e.g., symptoms), and the other was the condition to be inferred (e.g., diseases). Each bigraph was regarded as a Markov random field (MRF) to support the inference. We proposed three graph-based energy functions and three likelihood-based energy functions. Two of these functions are based on knowledge representation learning and can provide distributed representations of medical entities. Two EMR datasets and three metrics were utilized to evaluate the performance.

Results: As a whole, the evaluation results indicate that the proposed system outperformed the baseline methods. The distributed representation of medical entities does reflect similarity relationships with respect to knowledge level.

Conclusion: Combining EMKN and MRF is an effective approach for general medical knowledge representation and inference. Different tasks, however, require individually designed energy functions.

Keywords: Clinical decision support; Distributed representation; Electronic medical record; Markov random field; Medical knowledge network.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Decision Support Systems, Clinical*
  • Electronic Health Records*
  • Likelihood Functions
  • Machine Learning*
  • Markov Chains