Strengthening deep-learning models for intracranial hemorrhage detection: strongly annotated computed tomography images and model ensembles

Dong-Wan Kang; Gi-Hun Park; Wi-Sun Ryu; Dawid Schellingerhout; Museong Kim; Yong Soo Kim; Chan-Young Park; Keon-Joo Lee; Moon-Ku Han; Han-Gil Jeong; Dong-Eog Kim

doi:10.3389/fneur.2023.1321964

Strengthening deep-learning models for intracranial hemorrhage detection: strongly annotated computed tomography images and model ensembles

Front Neurol. 2023 Dec 29:14:1321964. doi: 10.3389/fneur.2023.1321964. eCollection 2023.

Authors

Dong-Wan Kang^{1

2

3}, Gi-Hun Park⁴, Wi-Sun Ryu⁴, Dawid Schellingerhout⁵, Museong Kim^{6

7}, Yong Soo Kim⁸, Chan-Young Park⁹, Keon-Joo Lee¹⁰, Moon-Ku Han³, Han-Gil Jeong^{3

6}, Dong-Eog Kim^{11

12}

Affiliations

¹ Department of Public Health, Seoul National University Bundang Hospital, Seongnam, Republic of Korea.
² Department of Neurology, Gyeonggi Provincial Medical Center, Icheon Hospital, Icheon, Republic of Korea.
³ Department of Neurology, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seongnam, Republic of Korea.
⁴ JLK Inc., Artificial Intelligence Research Center, Seoul, Republic of Korea.
⁵ Department of Neuroradiology and Imaging Physics, The University of Texas M.D. Anderson Cancer Center, Houston, TX, United States.
⁶ Department of Neurosurgery, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seongnam, Republic of Korea.
⁷ Hospital Medicine Center, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seongnam, Republic of Korea.
⁸ Department of Neurology, Nowon Eulji Medical Center, Eulji University School of Medicine, Seoul, Republic of Korea.
⁹ Department of Neurology, Chung-Ang University Hospital, Seoul, Republic of Korea.
¹⁰ Department of Neurology, Korea University Guro Hospital, Seoul, Republic of Korea.
¹¹ Department of Neurology, Dongguk University Ilsan Hospital, Goyang, Republic of Korea.
¹² National Priority Research Center for Stroke, Goyang, Republic of Korea.

Abstract

Background and purpose: Multiple attempts at intracranial hemorrhage (ICH) detection using deep-learning techniques have been plagued by clinical failures. We aimed to compare the performance of a deep-learning algorithm for ICH detection trained on strongly and weakly annotated datasets, and to assess whether a weighted ensemble model that integrates separate models trained using datasets with different ICH improves performance.

Methods: We used brain CT scans from the Radiological Society of North America (27,861 CT scans, 3,528 ICHs) and AI-Hub (53,045 CT scans, 7,013 ICHs) for training. DenseNet121, InceptionResNetV2, MobileNetV2, and VGG19 were trained on strongly and weakly annotated datasets and compared using independent external test datasets. We then developed a weighted ensemble model combining separate models trained on all ICH, subdural hemorrhage (SDH), subarachnoid hemorrhage (SAH), and small-lesion ICH cases. The final weighted ensemble model was compared to four well-known deep-learning models. After external testing, six neurologists reviewed 91 ICH cases difficult for AI and humans.

Results: InceptionResNetV2, MobileNetV2, and VGG19 models outperformed when trained on strongly annotated datasets. A weighted ensemble model combining models trained on SDH, SAH, and small-lesion ICH had a higher AUC, compared with a model trained on all ICH cases only. This model outperformed four deep-learning models (AUC [95% C.I.]: Ensemble model, 0.953[0.938-0.965]; InceptionResNetV2, 0.852[0.828-0.873]; DenseNet121, 0.875[0.852-0.895]; VGG19, 0.796[0.770-0.821]; MobileNetV2, 0.650[0.620-0.680]; p < 0.0001). In addition, the case review showed that a better understanding and management of difficult cases may facilitate clinical use of ICH detection algorithms.

Conclusion: We propose a weighted ensemble model for ICH detection, trained on large-scale, strongly annotated CT scans, as no model can capture all aspects of complex tasks.

Keywords: deep-learning algorithm; intracranial hemorrhage (ICH); neuroimaging; strongly annotated dataset; weighted ensemble model.

Grants and funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This research was supported by the Brain Disease Overcoming Research Program Grant (NRF-2020M3E5D9079768), the National Priority Research Center Program Grant (NRF-2021R1A6A1A03038865), the Basic Science Research Program Grant (NRF-2020R1A2C3008295), the Grant for Medical Device Development (KMDF_PR_20200901_0098), and the Bioimaging Data Curation Center Program Grant (NRF-2022M3H9A2083956) of the National Research Foundation, funded by the Korean government.