Automated classification of primary care patient safety incident report content and severity using supervised machine learning (ML) approaches

Huw Prosser Evans; Athanasios Anastasiou; Adrian Edwards; Peter Hibbert; Meredith Makeham; Saturnino Luz; Aziz Sheikh; Liam Donaldson; Andrew Carson-Stevens

doi:10.1177/1460458219833102

Automated classification of primary care patient safety incident report content and severity using supervised machine learning (ML) approaches

Health Informatics J. 2020 Dec;26(4):3123-3139. doi: 10.1177/1460458219833102. Epub 2019 Mar 7.

Authors

Huw Prosser Evans¹, Athanasios Anastasiou², Adrian Edwards¹, Peter Hibbert³, Meredith Makeham⁴, Saturnino Luz, Aziz Sheikh⁵, Liam Donaldson⁶, Andrew Carson-Stevens⁷

Affiliations

¹ Cardiff University, UK.
² Swansea University, UK.
³ Macquarie University, Australia; University of South Australia, Australia.
⁴ Macquarie University, Australia.
⁵ The University of Edinburgh, UK.
⁶ London School of Hygiene & Tropical Medicine, UK.
⁷ Cardiff University, UK; Macquarie University, Australia.

PMID: 30843455
DOI: 10.1177/1460458219833102

Abstract

Learning from patient safety incident reports is a vital part of improving healthcare. However, the volume of reports and their largely free-text nature poses a major analytic challenge. The objective of this study was to test the capability of autonomous classifying of free text within patient safety incident reports to determine incident type and the severity of harm outcome. Primary care patient safety incident reports (n=31333) previously expert-categorised by clinicians (training data) were processed using J48, SVM and Naïve Bayes.The SVM classifier was the highest scoring classifier for incident type (AUROC, 0.891) and severity of harm (AUROC, 0.708). Incident reports containing deaths were most easily classified, correctly identifying 72.82% of reports. In conclusion, supervised ML can be used to classify patient safety incident report categories. The severity classifier, whilst not accurate enough to replace manual processing, could provide a valuable screening tool for this critical aspect of patient safety.

Keywords: incident reporting; machine learning; natural language processing; patient safety; quality improvement.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Bayes Theorem
Humans
Patient Safety*
Primary Health Care
Supervised Machine Learning
Support Vector Machine*