Deep semi-supervised multiple instance learning with self-correction for DME classification from OCT images

Xi Wang; Fangyao Tang; Hao Chen; Carol Y Cheung; Pheng-Ann Heng

doi:10.1016/j.media.2022.102673

Deep semi-supervised multiple instance learning with self-correction for DME classification from OCT images

Med Image Anal. 2023 Jan:83:102673. doi: 10.1016/j.media.2022.102673. Epub 2022 Oct 26.

Authors

Xi Wang¹, Fangyao Tang², Hao Chen³, Carol Y Cheung², Pheng-Ann Heng⁴

Affiliations

¹ Zhejiang Lab, Hangzhou, China; Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, China; Department of Radiation Oncology, Stanford University School of Medicine, Palo Alto, CA, USA.
² Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong, China.
³ Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Hong Kong, China. Electronic address: jhc@cse.ust.hk.
⁴ Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, China; Guangdong-Hong Kong-Macao Joint Laboratory of Human-Machine Intelligence-Synergy Systems, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, China.

PMID: 36403310
DOI: 10.1016/j.media.2022.102673

Abstract

Supervised deep learning has achieved prominent success in various diabetic macular edema (DME) recognition tasks from optical coherence tomography (OCT) volumetric images. A common problematic issue that frequently occurs in this field is the shortage of labeled data due to the expensive fine-grained annotations, which increases substantial difficulty in accurate analysis by supervised learning. The morphological changes in the retina caused by DME might be distributed sparsely in B-scan images of the OCT volume, and OCT data is often coarsely labeled at the volume level. Hence, the DME identification task can be formulated as a multiple instance classification problem that could be addressed by multiple instance learning (MIL) techniques. Nevertheless, none of previous studies utilize unlabeled data simultaneously to promote the classification accuracy, which is particularly significant for a high quality of analysis at the minimum annotation cost. To this end, we present a novel deep semi-supervised multiple instance learning framework to explore the feasibility of leveraging a small amount of coarsely labeled data and a large amount of unlabeled data to tackle this problem. Specifically, we come up with several modules to further improve the performance according to the availability and granularity of their labels. To warm up the training, we propagate the bag labels to the corresponding instances as the supervision of training, and propose a self-correction strategy to handle the label noise in the positive bags. This strategy is based on confidence-based pseudo-labeling with consistency regularization. The model uses its prediction to generate the pseudo-label for each weakly augmented input only if it is highly confident about the prediction, which is subsequently used to supervise the same input in a strongly augmented version. This learning scheme is also applicable to unlabeled data. To enhance the discrimination capability of the model, we introduce the Student-Teacher architecture and impose consistency constraints between two models. For demonstration, the proposed approach was evaluated on two large-scale DME OCT image datasets. Extensive results indicate that the proposed method improves DME classification with the incorporation of unlabeled data and outperforms competing MIL methods significantly, which confirm the feasibility of deep semi-supervised multiple instance learning at a low annotation cost.

Keywords: Classification; Multiple instance learning; OCT; Semi-supervised learning.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Diabetic Retinopathy* / diagnostic imaging
Humans
Macular Edema* / diagnostic imaging
Retina / diagnostic imaging
Supervised Machine Learning
Tomography, Optical Coherence