Multiview Semantic Representation for Visual Recognition

Chunjie Zhang; Jian Cheng; Qi Tian

doi:10.1109/TCYB.2018.2875728

Multiview Semantic Representation for Visual Recognition

IEEE Trans Cybern. 2020 May;50(5):2038-2049. doi: 10.1109/TCYB.2018.2875728. Epub 2018 Nov 6.

Authors

Chunjie Zhang, Jian Cheng, Qi Tian

PMID: 30418893
DOI: 10.1109/TCYB.2018.2875728

Abstract

Due to interclass and intraclass variations, the images of different classes are often cluttered which makes it hard for efficient classifications. The use of discriminative classification algorithms helps to alleviate this problem. However, it is still an open problem to accurately model the relationships between visual representations and human perception. To alleviate these problems, in this paper, we propose a novel multiview semantic representation (MVSR) algorithm for efficient visual recognition. First, we leverage visually based methods to get initial image representations. We then use both visual and semantic similarities to divide images into groups which are then used for semantic representations. We treat different image representation strategies, partition methods, and numbers as different views. A graph is then used to combine the discriminative power of different views. The similarities between images can be obtained by measuring the similarities of graphs. Finally, we train classifiers to predict the categories of images. We evaluate the discriminative power of the proposed MVSR method for visual recognition on several public image datasets. Experimental results show the effectiveness of the proposed method.

MeSH terms

Algorithms
Deep Learning
Humans
Image Processing, Computer-Assisted / methods*
Pattern Recognition, Automated / methods*
Semantics*