Zero-shot learning via visual-semantic aligned autoencoder

Tianshu Wei; Jinjie Huang; Cong Jin

doi:10.3934/mbe.2023629

Zero-shot learning via visual-semantic aligned autoencoder

Math Biosci Eng. 2023 Jun 25;20(8):14081-14095. doi: 10.3934/mbe.2023629.

Authors

Tianshu Wei¹, Jinjie Huang^{1

2}, Cong Jin¹

Affiliations

¹ School of Computer Science and Technology, Harbin University of Science and Technology, Harbin 150006, China.
² School of Automation, Harbin University of Science and Technology, Harbin 150006, China.

PMID: 37679126
DOI: 10.3934/mbe.2023629

Abstract

Zero-shot learning recognizes the unseen samples via the model learned from the seen class samples and semantic features. Due to the lack of information of unseen class samples in the training set, some researchers have proposed the method of generating unseen class samples by using generative models. However, the generated model is trained with the training set samples first, and then the unseen class samples are generated, which results in the features of the unseen class samples tending to be biased toward the seen class and may produce large deviations from the real unseen class samples. To tackle this problem, we use the autoencoder method to generate the unseen class samples and combine the semantic features of the unseen classes with the proposed new sample features to construct the loss function. The proposed method is validated on three datasets and showed good results.

Keywords: autoencoder; conventional zero-shot learning; generalized zero-shot learning; generated samples; modalities alignment.