Fully Self-Supervised Out-of-Domain Few-Shot Learning with Masked Autoencoders

Reece Walsh; Islam Osman; Omar Abdelaziz; Mohamed S Shehata

doi:10.3390/jimaging10010023

Fully Self-Supervised Out-of-Domain Few-Shot Learning with Masked Autoencoders

J Imaging. 2024 Jan 16;10(1):23. doi: 10.3390/jimaging10010023.

Authors

Reece Walsh¹, Islam Osman¹, Omar Abdelaziz¹, Mohamed S Shehata¹

Affiliation

¹ Irving K. Barber Faculty of Science, University of British Columbia, Kelowna, BC V1V 1V7, Canada.

PMID: 38249008
DOI: 10.3390/jimaging10010023

Abstract

Few-shot learning aims to identify unseen classes with limited labelled data. Recent few-shot learning techniques have shown success in generalizing to unseen classes; however, the performance of these techniques has also been shown to degrade when tested on an out-of-domain setting. Previous work, additionally, has also demonstrated increasing reliance on supervised finetuning in an off-line or online capacity. This paper proposes a novel, fully self-supervised few-shot learning technique (FSS) that utilizes a vision transformer and masked autoencoder. The proposed technique can generalize to out-of-domain classes by finetuning the model in a fully self-supervised method for each episode. We evaluate the proposed technique using three datasets (all out-of-domain). As such, our results show that FSS has an accuracy gain of 1.05%, 0.12%, and 1.28% on the ISIC, EuroSat, and BCCD datasets, respectively, without the use of supervised training.

Keywords: few-shot learning; image classification; out-of-domain; self-supervised.