Reshaping medical education: Performance of ChatGPT on a PES medical examination

Simona Wójcik; Anna Rulkiewicz; Piotr Pruszczyk; Wojciech Lisik; Marcin Poboży; Justyna Domienik-Karłowicz

doi:10.5603/cj.97517

Reshaping medical education: Performance of ChatGPT on a PES medical examination

Cardiol J. 2023 Oct 13. doi: 10.5603/cj.97517. Online ahead of print.

Authors

Simona Wójcik¹, Anna Rulkiewicz¹, Piotr Pruszczyk², Wojciech Lisik³, Marcin Poboży⁴, Justyna Domienik-Karłowicz^{5

6}

Affiliations

¹ LUX MED Llc, Warsaw, Poland.
² Department of Internal Medicine and Cardiology with The Center for Diagnosis and Treatment of Thromboembolism, Medical University of Warsaw, Poland.
³ Department of General and Transplantation Surgery, Medical University of Warsaw, Poland.
⁴ Cichowski Pobozy Healthcare Facility, Maciejowice, Poland.
⁵ Department of Internal Medicine and Cardiology with The Center for Diagnosis and Treatment of Thromboembolism, Medical University of Warsaw, Poland. jdomienik@tlen.pl.
⁶ LUX MED Llc, Warsaw, Poland. jdomienik@tlen.pl.

PMID: 37830257
DOI: 10.5603/cj.97517

Abstract

Background: We are currently experiencing a third digital revolution driven by artificial intelligence (AI), and the emergence of new chat generative pre-trained transformer (ChatGPT) represents a significant technological advancement with profound implications for global society, especially in the field of education.

Methods: The aim of this study was to see how well ChatGPT performed on medical school exams and to highlight how it might change medical education and practice. Recently, OpenAI's ChatGPT (OpenAI, San Francisco; GPT-4 May 24 Version) was put to the test against a significant Polish medical specialization licensing exam (PES), and the results are in. The version of ChatGPT-4 used in this study was the most up-to-date model at the time of publication (GPT-4). ChatGPT answered questions from June 28, 2023, to June 30, 2023.

Results: ChatGPT demonstrates notable advancements in natural language processing models on the tasks of medical question answering. In June 2023, the performance of ChatGPT was assessed based on its ability to answer a set of 120 questions, where it achieved a correct response rate of 67.1%, accurately responding to 80 questions.

Conclusions: ChatGPT may be used as an assistance tool in medical education. While ChatGPT can serve as a valuable tool in medical education, it cannot fully replace human expertise and knowledge due to its inherent limitations.

Keywords: AI in medicine; ChatGPT; artificial intelligence; health IT; innovations; language processing; medical education; virtual teaching assistant.