Salient region detection through salient and non-salient dictionaries

PLoS One. 2019 Mar 28;14(3):e0213433. doi: 10.1371/journal.pone.0213433. eCollection 2019.

Abstract

Low-rank representation-based frameworks are becoming popular for the saliency and the object detection because of their easiness and simplicity. These frameworks only need global features to extract the salient objects while the local features are compromised. To deal with this issue, we regularize the low-rank representation through a local graph-regularization and a maximum mean-discrepancy regularization terms. Firstly, we introduce a novel feature space that is extracted by combining the four feature spaces like CIELab, RGB, HOG and LBP. Secondly, we combine a boundary metric, a candidate objectness metric and a candidate distance metric to compute the low-level saliency map. Thirdly, we extract salient and non-salient dictionaries from the low-level saliency. Finally, we regularize the low-rank representation through the Laplacian regularization term that saves the structural and geometrical features and using the mean discrepancy term that reduces the distribution divergence and connections among similar regions. The proposed model is tested against seven latest salient region detection methods using the precision-recall curve, receiver operating characteristics curve, F-measure and mean absolute error. The proposed model remains persistent in all the tests and outperformed against the selected models with higher precision value.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Databases, Factual
  • Dictionaries as Topic
  • Humans
  • Image Processing, Computer-Assisted / methods*
  • Image Processing, Computer-Assisted / statistics & numerical data
  • Machine Learning
  • Neural Networks, Computer
  • Photography
  • Visual Perception

Associated data

  • figshare/10.6084/m9.figshare.7785476

Grants and funding

This work was supported in part by the National Natural Science Foundation of China (Grant no. 60972124), in part by the National High-tech Research and Development Program of China (Grant no. 2009AA01Z321) and in part of the Specialized Research Fund for the Doctoral Program of Higher Education of China (Grant no. 20110201110012). There was no additional external funding received for this study.