An Attention Model With Transfer Embeddings to Classify Pneumonia-Related Bilingual Imaging Reports: Algorithm Development and Validation

Hyung Park; Min Song; Eun Byul Lee; Bo Kyung Seo; Chang Min Choi

doi:10.2196/24803

An Attention Model With Transfer Embeddings to Classify Pneumonia-Related Bilingual Imaging Reports: Algorithm Development and Validation

JMIR Med Inform. 2021 May 17;9(5):e24803. doi: 10.2196/24803.

Authors

Hyung Park^#¹, Min Song^#², Eun Byul Lee², Bo Kyung Seo², Chang Min Choi^{1

3}

Affiliations

¹ Department of Pulmonary and Critical Care Medicine, Asan Medical Center, Seoul, Republic of Korea.
² Yonsei University, Seoul, Republic of Korea.
³ Department of Oncology, Asan Medical Center, Seoul, Republic of Korea.

^# Contributed equally.

PMID: 33820755
PMCID: PMC8167619
DOI: 10.2196/24803

Abstract

Background: In the analysis of electronic health records, proper labeling of outcomes is mandatory. To obtain proper information from radiologic reports, several studies were conducted to classify radiologic reports using deep learning. However, the classification of pneumonia in bilingual radiologic reports has not been conducted previously.

Objective: The aim of this research was to classify radiologic reports into pneumonia or no pneumonia using a deep learning method.

Methods: A data set of radiology reports for chest computed tomography and chest x-rays of surgical patients from January 2008 to January 2018 in the Asan Medical Center in Korea was retrospectively analyzed. The classification performance of our long short-term memory (LSTM)-Attention model was compared with various deep learning and machine learning methods. The area under the receiver operating characteristic curve (AUROC), area under the precision-recall curve, sensitivity, specificity, accuracy, and F1 score for the models were compared.

Results: A total of 5450 radiologic reports were included that contained at least one pneumonia-related word. In the test set (n=1090), our proposed model showed 91.01% (992/1090) accuracy (AUROCs for negative, positive, and obscure were 0.98, 0.97, and 0.90, respectively). The top 3 performances of the models were based on FastText or LSTM. The convolutional neural network-based model showed a lower accuracy 73.03% (796/1090) than the other 2 algorithms. The classification of negative results had an F1 score of 0.96, whereas the classification of positive and uncertain results showed a lower performance (positive F1 score 0.83; uncertain F1 score 0.62). In the extra-validation set, our model showed 80.0% (642/803) accuracy (AUROCs for negative, positive, and obscure were 0.92, 0.96, and 0.84, respectively).

Conclusions: Our method showed excellent performance in classifying pneumonia in bilingual radiologic reports. The method could enrich the research on pneumonia by obtaining exact outcomes from electronic health data.

Keywords: attention; classification; clinical data; deep learning; electronic health record; machine learning; medical imaging; model; natural language process; pneumonia.

©Hyung Park, Min Song, Eun Byul Lee, Bo Kyung Seo, Chang Min Choi. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 17.05.2021.