Computational model of stereoscopic 3D visual saliency

Junle Wang; Matthieu Perreira Da Silva; Patrick Le Callet; Vincent Ricordel

doi:10.1109/TIP.2013.2246176

Computational model of stereoscopic 3D visual saliency

IEEE Trans Image Process. 2013 Jun;22(6):2151-65. doi: 10.1109/TIP.2013.2246176. Epub 2013 Feb 11.

Authors

Junle Wang¹, Matthieu Perreira Da Silva, Patrick Le Callet, Vincent Ricordel

Affiliation

¹ LUNAM Université, Université de Nantes, Institut de Recherche en Communications et Cybernétique de Nantes, Polytech Nantes, Nantes 44306, France. wang.junle@gmail.com

PMID: 23412612
DOI: 10.1109/TIP.2013.2246176

Abstract

Many computational models of visual attention performing well in predicting salient areas of 2D images have been proposed in the literature. The emerging applications of stereoscopic 3D display bring an additional depth of information affecting the human viewing behavior, and require extensions of the efforts made in 2D visual modeling. In this paper, we propose a new computational model of visual attention for stereoscopic 3D still images. Apart from detecting salient areas based on 2D visual features, the proposed model takes depth as an additional visual dimension. The measure of depth saliency is derived from the eye movement data obtained from an eye-tracking experiment using synthetic stimuli. Two different ways of integrating depth information in the modeling of 3D visual attention are then proposed and examined. For the performance evaluation of 3D visual attention models, we have created an eye-tracking database, which contains stereoscopic images of natural content and is publicly available, along with this paper. The proposed model gives a good performance, compared to that of state-of-the-art 2D models on 2D images. The results also suggest that a better performance is obtained when depth information is taken into account through the creation of a depth saliency map, rather than when it is integrated by a weighting method.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Adolescent
Adult
Attention
Computer Simulation
Databases, Factual
Depth Perception / physiology*
Eye Movements / physiology*
Humans
Image Processing, Computer-Assisted / methods*
Middle Aged
Models, Biological*
Photic Stimulation