A difficulty-aware and task-augmentation method based on meta-learning model for few-shot diabetic retinopathy classification

Xueyao Liu; Xueyuan Dong; Tuo Li; Xiaofeng Zou; Chen Cheng; Zekun Jiang; Zhumin Gao; Sixu Duan; Meirong Chen; Tingting Liu; Pu Huang; Dengwang Li; Hua Lu

doi:10.21037/qims-23-567

A difficulty-aware and task-augmentation method based on meta-learning model for few-shot diabetic retinopathy classification

Quant Imaging Med Surg. 2024 Jan 3;14(1):861-876. doi: 10.21037/qims-23-567. Epub 2024 Jan 2.

Authors

Xueyao Liu^#¹, Xueyuan Dong^#¹, Tuo Li², Xiaofeng Zou², Chen Cheng¹, Zekun Jiang¹, Zhumin Gao¹, Sixu Duan¹, Meirong Chen³, Tingting Liu⁴, Pu Huang¹, Dengwang Li¹, Hua Lu¹

Affiliations

¹ Shandong Key Laboratory of Medical Physics and Image Processing, Shandong Institute of Industrial Technology for Health Sciences and Precision Medicine, School of Physics and Electronics, Shandong Normal University, Jinan, China.
² Shandong Yunhai Guochuang Cloud Computing Equipment Industry Innovation Co., Ltd., Jinan, China.
³ Affiliated Hospital of Shandong University of Traditional Chinese Medicine, Jinan, China.
⁴ Eye Hospital of Shandong First Medical University (Shandong Eye Hospital), Jinan, China.

^# Contributed equally.

Abstract

Background: Accurate classification techniques are essential for the early diagnosis and treatment of patients with diabetic retinopathy (DR). However, the limited amount of annotated DR data poses a challenge for existing deep-learning models. This article proposes a difficulty-aware and task-augmentation method based on meta-learning (DaTa-ML) model for few-shot DR classification with fundus images.

Methods: The difficulty-aware (Da) method operates by dynamically modifying the cross-entropy loss function applied to learning tasks. This methodology has the ability to intelligently down-weight simpler tasks, while simultaneously prioritizing more challenging tasks. These adjustments occur automatically and aim to optimize the learning process. Additionally, the task-augmentation (Ta) method is used to enhance the meta-training process by augmenting the number of tasks through image rotation and improving the feature-extraction capability. To implement the expansion of the meta-training tasks, various task instances can be sampled during the meta-training stage. Ultimately, the proposed Ta method was introduced to optimize the initialization parameters and enhance the meta-generalization performance of the model. The DaTa-ML model showed promising results by effectively addressing the challenges associated with few-shot DR classification.

Results: The Asia Pacific Tele-Ophthalmology Society (APTOS) 2019 blindness detection data set was used to evaluate the DaTa-ML model. The results showed that with only 1% of the training data (5-way, 20-shot) and a single update step (training time reduced by 90%), the DaTa-ML model had an accuracy rate of 89.6% on the test data, which is a 1.7% improvement over the transfer-learning method [i.e., residual neural network (ResNet)50 pre-trained on ImageNet], and a 16.8% improvement over scratch-built models (i.e., ResNet50 without pre-trained weights), despite having fewer trainable parameters (the parameters used by the DaTa-ML model are only 0.47% of the ResNet50 parameters).

Conclusions: The DaTa-ML model provides a more efficient DR classification solution with little annotated data and has significant advantages over state-of-the-art methods. Thus, it could be used to guide and assist ophthalmologists to determine the severity of DR.

Keywords: Diabetic retinopathy classification (DR classification); difficulty-aware (Da); few-shot; meta-learning; task-augmentation (Ta).