ChatGPT Performs on the Chinese National Medical Licensing Examination

Xinyi Wang; Zhenye Gong; Guoxin Wang; Jingdan Jia; Ying Xu; Jialu Zhao; Qingye Fan; Shaun Wu; Weiguo Hu; Xiaoyang Li

doi:10.1007/s10916-023-01961-0

ChatGPT Performs on the Chinese National Medical Licensing Examination

J Med Syst. 2023 Aug 15;47(1):86. doi: 10.1007/s10916-023-01961-0.

Authors

Xinyi Wang^#¹, Zhenye Gong^#¹, Guoxin Wang¹, Jingdan Jia¹, Ying Xu¹, Jialu Zhao¹, Qingye Fan¹, Shaun Wu², Weiguo Hu¹, Xiaoyang Li³

Affiliations

¹ Department of Medical Education, Ruijin Hospital Affifiliated to Shanghai Jiao Tong University School of Medicine, 197 Ruijin Rd. II, Shanghai, 200025, China.
² WORK Medical Technology Group LTD, Hangzhou, China.
³ Department of Medical Education, Ruijin Hospital Affifiliated to Shanghai Jiao Tong University School of Medicine, 197 Ruijin Rd. II, Shanghai, 200025, China. woodslee429@126.com.

^# Contributed equally.

PMID: 37581690
DOI: 10.1007/s10916-023-01961-0

Abstract

ChatGPT, a language model developed by OpenAI, uses a 175 billion parameter Transformer architecture for natural language processing tasks. This study aimed to compare the knowledge and interpretation ability of ChatGPT with those of medical students in China by administering the Chinese National Medical Licensing Examination (NMLE) to both ChatGPT and medical students. We evaluated the performance of ChatGPT in three years' worth of the NMLE, which consists of four units. At the same time, the exam results were compared to those of medical students who had studied for five years at medical colleges. ChatGPT's performance was lower than that of the medical students, and ChatGPT's correct answer rate was related to the year in which the exam questions were released. ChatGPT's knowledge and interpretation ability for the NMLE were not yet comparable to those of medical students in China. It is probable that these abilities will improve through deep learning.

Keywords: ChatGPT; Chinese National Medical Licensing Examination; Medical student.

Publication types

Comparative Study

MeSH terms

Artificial Intelligence*
Asian People
China
Educational Measurement* / standards
Humans
Knowledge
Language
Licensure* / standards
Medicine* / standards
Students, Medical* / statistics & numerical data