ChatGPT's performance before and after teaching in mass casualty incident triage

Rick Kye Gan; Helal Uddin; Ann Zee Gan; Ying Ying Yew; Pedro Arcos González

doi:10.1038/s41598-023-46986-0

ChatGPT's performance before and after teaching in mass casualty incident triage

Sci Rep. 2023 Nov 21;13(1):20350. doi: 10.1038/s41598-023-46986-0.

Authors

Rick Kye Gan¹, Helal Uddin^{2

3

4}, Ann Zee Gan⁵, Ying Ying Yew¹, Pedro Arcos González¹

Affiliations

¹ Unit for Research in Emergency and Disaster, Faculty of Medicine and Health Sciences, University of Oviedo, 33006, Oviedo, Spain.
² Unit for Research in Emergency and Disaster, Faculty of Medicine and Health Sciences, University of Oviedo, 33006, Oviedo, Spain. md.helal.uddin@stud.ki.se.
³ Department of Global Public Health, Karolinska Institute, 17177, Solna, Sweden. md.helal.uddin@stud.ki.se.
⁴ Department of Sociology, East West University, Dhaka, 1212, Bangladesh. md.helal.uddin@stud.ki.se.
⁵ Tenghilan Health Clinic, 89208, Tuaran, Sabah, Malaysia.

Abstract

Since its initial launching, ChatGPT has gained significant attention from the media, with many claiming that ChatGPT's arrival is a transformative milestone in the advancement of the AI revolution. Our aim was to assess the performance of ChatGPT before and after teaching the triage of mass casualty incidents by utilizing a validated questionnaire specifically designed for such scenarios. In addition, we compared the triage performance between ChatGPT and medical students. Our cross-sectional study employed a mixed-methods analysis to assess the performance of ChatGPT in mass casualty incident triage, pre- and post-teaching of Simple Triage And Rapid Treatment (START) triage. After teaching the START triage algorithm, ChatGPT scored an overall triage accuracy of 80%, with only 20% of cases being over-triaged. The mean accuracy of medical students on the same questionnaire yielded 64.3%. Qualitative analysis on pre-determined themes on 'walking-wounded', 'respiration', 'perfusion', and 'mental status' on ChatGPT showed similar performance in pre- and post-teaching of START triage. Additional themes on 'disclaimer', 'prediction', 'management plan', and 'assumption' were identified during the thematic analysis. ChatGPT exhibited promising results in effectively responding to mass casualty incident questionnaires. Nevertheless, additional research is necessary to ensure its safety and efficacy before clinical implementation.

MeSH terms

Algorithms
Computer Simulation
Cross-Sectional Studies
Humans
Mass Casualty Incidents*
Triage* / methods