Integrating Visual and Network Data with Deep Learning for Streaming Video Quality Assessment

Sensors (Basel). 2023 Apr 14;23(8):3998. doi: 10.3390/s23083998.

Abstract

Existing video Quality-of-Experience (QoE) metrics rely on the decoded video for the estimation. In this work, we explore how the overall viewer experience, quantified via the QoE score, can be automatically derived using only information available before and during the transmission of videos, on the server side. To validate the merits of the proposed scheme, we consider a dataset of videos encoded and streamed under different conditions and train a novel deep learning architecture for estimating the QoE of the decoded video. The major novelty of our work is the exploitation and demonstration of cutting-edge deep learning techniques in automatically estimating video QoE scores. Our work significantly extends the existing approach for estimating the QoE in video streaming services by combining visual information and network conditions.

Keywords: ITU-T P.1203; PatchVQ; QoE; QoE assessment; deep learning; video streaming.