Bidirectional Mapping Coupled GAN for Generalized Zero-Shot Learning

Tasfia Shermin; Shyh Wei Teng; Ferdous Sohel; Manzur Murshed; Guojun Lu

doi:10.1109/TIP.2021.3135480

Bidirectional Mapping Coupled GAN for Generalized Zero-Shot Learning

IEEE Trans Image Process. 2022:31:721-733. doi: 10.1109/TIP.2021.3135480. Epub 2021 Dec 28.

Authors

Tasfia Shermin, Shyh Wei Teng, Ferdous Sohel, Manzur Murshed, Guojun Lu

PMID: 34928799
DOI: 10.1109/TIP.2021.3135480

Abstract

Bidirectional mapping-based generalized zero-shot learning (GZSL) methods rely on the quality of synthesized features to recognize seen and unseen data. Therefore, learning a joint distribution of seen-unseen classes and preserving the distinction between seen-unseen classes is crucial for GZSL methods. However, existing methods only learn the underlying distribution of seen data, although unseen class semantics are available in the GZSL problem setting. Most methods neglect retaining seen-unseen classes distinction and use the learned distribution to recognize seen and unseen data. Consequently, they do not perform well. In this work, we utilize the available unseen class semantics alongside seen class semantics and learn joint distribution through a strong visual-semantic coupling. We propose a bidirectional mapping coupled generative adversarial network (BMCoGAN) by extending the concept of the coupled generative adversarial network into a bidirectional mapping model. We further integrate a Wasserstein generative adversarial optimization to supervise the joint distribution learning. We design a loss optimization for retaining distinctive information of seen-unseen classes in the synthesized features and reducing bias towards seen classes, which pushes synthesized seen features towards real seen features and pulls synthesized unseen features away from real seen features. We evaluate BMCoGAN on benchmark datasets and demonstrate its superior performance against contemporary methods.