Using Natural Language Processing to Predict Risk in Electronic Health Records

Stud Health Technol Inform. 2024 Jan 25:310:574-578. doi: 10.3233/SHTI231030.

Abstract

Clinical narratives recording behaviours and emotions of patients are available from EHRs in a forensic psychiatric centre located in Tasmania. This rich data has not been used in risk prediction. Prior work demonstrates natural language processing can be used to identify patient symptoms in these free-text records and can then be used to predict risk. Four dictionaries containing descriptive words of harm were created using the Diagnostic and Statistical Manual of Mental Disorders, the Unified Medical Language System repository, English negative and positive sentiment words, and high-frequency words from the Corpus of Contemporary American English. However, a model based only on these keywords is limited in predictive power. In this study, we introduce an improved NLP approach with a social interaction component to extract additional information about the behavioural and emotional state of patients. These social interactions are subsequently used in a machine-learning model to enhance risk prediction performance.

Keywords: Natural language processing; electronic health record; mental health.

MeSH terms

  • Electronic Health Records*
  • Emotions
  • Health Facilities
  • Humans
  • Language
  • Natural Language Processing*