Zero-shot learning via visual-semantic aligned autoencoder

Math Biosci Eng. 2023 Jun 25;20(8):14081-14095. doi: 10.3934/mbe.2023629.

Abstract

Zero-shot learning recognizes the unseen samples via the model learned from the seen class samples and semantic features. Due to the lack of information of unseen class samples in the training set, some researchers have proposed the method of generating unseen class samples by using generative models. However, the generated model is trained with the training set samples first, and then the unseen class samples are generated, which results in the features of the unseen class samples tending to be biased toward the seen class and may produce large deviations from the real unseen class samples. To tackle this problem, we use the autoencoder method to generate the unseen class samples and combine the semantic features of the unseen classes with the proposed new sample features to construct the loss function. The proposed method is validated on three datasets and showed good results.

Keywords: autoencoder; conventional zero-shot learning; generalized zero-shot learning; generated samples; modalities alignment.