Self-supervised learning with self-distillation on COVID-19 medical image classification

Zhiyong Tan; Yuhai Yu; Jiana Meng; Shuang Liu; Wei Li

doi:10.1016/j.cmpb.2023.107876

Self-supervised learning with self-distillation on COVID-19 medical image classification

Comput Methods Programs Biomed. 2024 Jan:243:107876. doi: 10.1016/j.cmpb.2023.107876. Epub 2023 Oct 18.

Authors

Zhiyong Tan¹, Yuhai Yu¹, Jiana Meng², Shuang Liu¹, Wei Li¹

Affiliations

¹ School of Computer Science and Engineering, Dalian Minzu University, Dalian, Liaoning 116600, China.
² School of Computer Science and Engineering, Dalian Minzu University, Dalian, Liaoning 116600, China. Electronic address: mengjn@dlnu.edu.cn.

PMID: 37875036
DOI: 10.1016/j.cmpb.2023.107876

Abstract

Background and objective: Currently, COVID-19 is a highly infectious disease that can be clinically diagnosed based on diagnostic radiology. Deep learning is capable of mining the rich information implied in inpatient imaging data and accomplishing the classification of different stages of the disease process. However, a large amount of training data is essential to train an excellent deep-learning model. Unfortunately, due to factors such as privacy and labeling difficulties, annotated data for COVID-19 is extremely scarce, which encourages us to propose a more effective deep learning model that can effectively assist specialist physicians in COVID-19 diagnosis.

Methods: In this study,we introduce Masked Autoencoder (MAE) for pre-training and fine-tuning directly on small-scale target datasets. Based on this, we propose Self-Supervised Learning with Self-Distillation on COVID-19 medical image classification (SSSD-COVID). In addition to the reconstruction loss computation on the masked image patches, SSSD-COVID further performs self-distillation loss calculations on the latent representation of the encoder and decoder outputs. The additional loss calculation can transfer the knowledge from the global attention of the decoder to the encoder which acquires only local attention.

Results: Our model achieves 97.78 % recognition accuracy on the SARS-COV-CT dataset containing 2481 images and is further validated on the COVID-CT dataset containing 746 images, which achieves 81.76 % recognition accuracy. Further introduction of external knowledge resulted in experimental accuracies of 99.6% and 95.27 % on these two datasets, respectively.

Conclusions: SSSD-COVID can obtain good results on the target dataset alone, and when external information is introduced, the performance of the model can be further improved to significantly outperform other models.Overall, the experimental results show that our method can effectively mine COVID-19 features from rare data and can assist professional physicians in decision-making to improve the efficiency of COVID-19 disease detection.

Keywords: COVID-19; Chest CT; Masked autoencoder; Self-distillation; Self-supervised learning.

MeSH terms

COVID-19 Testing
COVID-19* / diagnostic imaging
Humans
Inpatients
Physicians*
Supervised Machine Learning