From pairwise comparisons and rating to a unified quality scale

Maria Perez-Ortiz; Aliaksei Mikhailiuk; Emin Zerman; Vedad Hulusic; Giuseppe Valenzise; Rafal K Mantiuk

doi:10.1109/TIP.2019.2936103

From pairwise comparisons and rating to a unified quality scale

IEEE Trans Image Process. 2019 Aug 28. doi: 10.1109/TIP.2019.2936103. Online ahead of print.

Authors

Maria Perez-Ortiz, Aliaksei Mikhailiuk, Emin Zerman, Vedad Hulusic, Giuseppe Valenzise, Rafal K Mantiuk

PMID: 31478849
DOI: 10.1109/TIP.2019.2936103

Abstract

The goal of psychometric scaling is the quantification of perceptual experiences, understanding the relationship between an external stimulus, the internal representation and the response. In this paper, we propose a probabilistic framework to fuse the outcome of different psychophysical experimental protocols, namely rating and pairwise comparisons experiments. Such a method can be used for merging existing datasets of subjective nature and for experiments in which both measurements are collected. We analyze and compare the outcomes of both types of experimental protocols in terms of time and accuracy in a set of simulations and experiments with benchmark and real-world image quality assessment datasets, showing the necessity of scaling and the advantages of each protocol and mixing. Although most of our examples focus on image quality assessment, our findings generalize to any other subjective quality-of-experience task.