A proof of concept for assessing emergency room use with primary care data and natural language processing

Methods Inf Med. 2013;52(1):33-42. doi: 10.3414/ME12-01-0012. Epub 2012 Dec 7.

Abstract

Objective: The objective of this study was to undertake a proof of concept that demonstrated the use of primary care data and natural language processing and term extraction to assess emergency room use. The study extracted biopsychosocial concepts from primary care free text and related them to inappropriate emergency room use through the use of odds ratios.

Methods: De-identified free text notes were extracted from a primary care clinic in Guelph, Ontario and analyzed with a software toolkit that incorporated General Architecture for Text Engineering (GATE) and MetaMap components for natural language processing and term extraction.

Results: Over 10 million concepts were extracted from 13,836 patient records. Codes found in at least 1% percent of the sample were regressed against inappropriate emergency room use. 77 codes fell within the realm of biopsychosocial, were very statistically significant (p < 0.001) and had an OR > 2.0. Thematically, these codes involved mental health and pain related concepts.

Conclusions: Analyzed thematically, mental health issues and pain are important themes; we have concluded that pain and mental health problems are primary drivers for inappropriate emergency room use. Age and sex were not significant. This proof of concept demonstrates the feasibly of combining natural language processing and primary care data to analyze a system use question. As a first work it supports further research and could be applied to investigate other, more complex problems.

MeSH terms

  • Computer Systems
  • Emergency Service, Hospital / statistics & numerical data*
  • Feasibility Studies
  • Health Services Misuse / statistics & numerical data
  • Humans
  • International Classification of Diseases
  • Medical Records Systems, Computerized
  • Mental Disorders / epidemiology
  • Mental Disorders / therapy
  • Natural Language Processing*
  • Ontario
  • Pain / epidemiology
  • Pain / etiology
  • Primary Health Care / statistics & numerical data*
  • Risk Factors
  • Software
  • Utilization Review / statistics & numerical data