Label-activating framework for zero-shot learning

Yang Liu; Xinbo Gao; Quanxue Gao; Jungong Han; Ling Shao

doi:10.1016/j.neunet.2019.08.023

Label-activating framework for zero-shot learning

Neural Netw. 2020 Jan:121:1-9. doi: 10.1016/j.neunet.2019.08.023. Epub 2019 Sep 6.

Authors

Yang Liu¹, Xinbo Gao², Quanxue Gao³, Jungong Han⁴, Ling Shao⁵

Affiliations

¹ State Key Laboratory of Integrated Services Networks, Xidian University, Shaanxi 710071, China.
² State Key Laboratory of Integrated Services Networks, Xidian University, Shaanxi 710071, China; School of Electronic Engineering, Xidian University, Shaanxi 710071, China.
³ State Key Laboratory of Integrated Services Networks, Xidian University, Shaanxi 710071, China. Electronic address: qxgao@xidian.edu.cn.
⁴ WMG Data Science, University of Warwick, CV4 7AL Coventry, United Kingdom.
⁵ Inception Institute of Artificial Intelligence, Abu Dhabi, United Arab Emirates.

PMID: 31522020
DOI: 10.1016/j.neunet.2019.08.023

Abstract

Existing zero-shot learning (ZSL) models usually learn mappings between visual space and semantic space. However, few of them take the label information into account. Indirect Attribute Prediction (IAP) learns the posterior probability of each attribute by label space, but labels of seen and unseen classes are defined in different spaces, which is not suitable for Generalized ZSL (GZSL). We propose a Label-Activating Framework (LAF) for semantic-based classification. The purpose of the proposed framework is to activate the label space by learning mappings from vision and semantics to labels. In the training phase, the original label space made up of one-hot vectors is used as common space, on which visual features and semantic information are embedded. After the label space is activated, labels of unseen classes can be regarded as the linear combination of labels of seen classes. In this case, seen and unseen labels are defined in the same space, and the label space has specific meanings rather than only signs of each class. Doing so makes the activated label space become very discriminative, especially for GZSL, which is therefore more challenging and reasonable for real-world tasks. In addition, we develop a specific model based on the framework, which effectively mitigate the projection domain shift problem. Extensive experiments show our framework outperforms state-of-the-art methods and also its suitability for GZSL.

Keywords: Discriminative; Label space; Semantic space; Zero-shot learning.

MeSH terms

Animals
Humans
Machine Learning* / trends
Semantics*