A difficulty-aware and task-augmentation method based on meta-learning model for few-shot diabetic retinopathy classification

Quant Imaging Med Surg. 2024 Jan 3;14(1):861-876. doi: 10.21037/qims-23-567. Epub 2024 Jan 2.

Abstract

Background: Accurate classification techniques are essential for the early diagnosis and treatment of patients with diabetic retinopathy (DR). However, the limited amount of annotated DR data poses a challenge for existing deep-learning models. This article proposes a difficulty-aware and task-augmentation method based on meta-learning (DaTa-ML) model for few-shot DR classification with fundus images.

Methods: The difficulty-aware (Da) method operates by dynamically modifying the cross-entropy loss function applied to learning tasks. This methodology has the ability to intelligently down-weight simpler tasks, while simultaneously prioritizing more challenging tasks. These adjustments occur automatically and aim to optimize the learning process. Additionally, the task-augmentation (Ta) method is used to enhance the meta-training process by augmenting the number of tasks through image rotation and improving the feature-extraction capability. To implement the expansion of the meta-training tasks, various task instances can be sampled during the meta-training stage. Ultimately, the proposed Ta method was introduced to optimize the initialization parameters and enhance the meta-generalization performance of the model. The DaTa-ML model showed promising results by effectively addressing the challenges associated with few-shot DR classification.

Results: The Asia Pacific Tele-Ophthalmology Society (APTOS) 2019 blindness detection data set was used to evaluate the DaTa-ML model. The results showed that with only 1% of the training data (5-way, 20-shot) and a single update step (training time reduced by 90%), the DaTa-ML model had an accuracy rate of 89.6% on the test data, which is a 1.7% improvement over the transfer-learning method [i.e., residual neural network (ResNet)50 pre-trained on ImageNet], and a 16.8% improvement over scratch-built models (i.e., ResNet50 without pre-trained weights), despite having fewer trainable parameters (the parameters used by the DaTa-ML model are only 0.47% of the ResNet50 parameters).

Conclusions: The DaTa-ML model provides a more efficient DR classification solution with little annotated data and has significant advantages over state-of-the-art methods. Thus, it could be used to guide and assist ophthalmologists to determine the severity of DR.

Keywords: Diabetic retinopathy classification (DR classification); difficulty-aware (Da); few-shot; meta-learning; task-augmentation (Ta).