Multilabel image annotation based on double-layer PLSA model

Jing Zhang; Da Li; Weiwei Hu; Zhihua Chen; Yubo Yuan

doi:10.1155/2014/494387

Multilabel image annotation based on double-layer PLSA model

ScientificWorldJournal. 2014:2014:494387. doi: 10.1155/2014/494387. Epub 2014 Jun 4.

Authors

Jing Zhang¹, Da Li², Weiwei Hu², Zhihua Chen², Yubo Yuan²

Affiliations

¹ School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China ; State Key Lab. for Novel Software Technology, Nanjing University, Nanjing, China.
² School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China.

Abstract

Due to the semantic gap between visual features and semantic concepts, automatic image annotation has become a difficult issue in computer vision recently. We propose a new image multilabel annotation method based on double-layer probabilistic latent semantic analysis (PLSA) in this paper. The new double-layer PLSA model is constructed to bridge the low-level visual features and high-level semantic concepts of images for effective image understanding. The low-level features of images are represented as visual words by Bag-of-Words model; latent semantic topics are obtained by the first layer PLSA from two aspects of visual and texture, respectively. Furthermore, we adopt the second layer PLSA to fuse the visual and texture latent semantic topics and achieve a top-layer latent semantic topic. By the double-layer PLSA, the relationships between visual features and semantic concepts of images are established, and we can predict the labels of new images by their low-level features. Experimental results demonstrate that our automatic image annotation model based on double-layer PLSA can achieve promising performance for labeling and outperform previous methods on standard Corel dataset.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Artificial Intelligence
Image Interpretation, Computer-Assisted / methods*
Models, Statistical
Models, Theoretical
Pattern Recognition, Automated