3D Visual Saliency: An Independent Perceptual Measure or a Derivative of 2D Image Saliency?

Ran Song; Wei Zhang; Yitian Zhao; Yonghuai Liu; Paul L Rosin

doi:10.1109/TPAMI.2023.3287356

3D Visual Saliency: An Independent Perceptual Measure or a Derivative of 2D Image Saliency?

IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):13083-13099. doi: 10.1109/TPAMI.2023.3287356. Epub 2023 Oct 3.

Authors

Ran Song, Wei Zhang, Yitian Zhao, Yonghuai Liu, Paul L Rosin

PMID: 37335789
DOI: 10.1109/TPAMI.2023.3287356

Abstract

While 3D visual saliency aims to predict regional importance of 3D surfaces in agreement with human visual perception and has been well researched in computer vision and graphics, latest work with eye-tracking experiments shows that state-of-the-art 3D visual saliency methods remain poor at predicting human fixations. Cues emerging prominently from these experiments suggest that 3D visual saliency might associate with 2D image saliency. This paper proposes a framework that combines a Generative Adversarial Network and a Conditional Random Field for learning visual saliency of both a single 3D object and a scene composed of multiple 3D objects with image saliency ground truth to 1) investigate whether 3D visual saliency is an independent perceptual measure or just a derivative of image saliency and 2) provide a weakly supervised method for more accurately predicting 3D visual saliency. Through extensive experiments, we not only demonstrate that our method significantly outperforms the state-of-the-art approaches, but also manage to answer the interesting and worthy question proposed within the title of this paper.