A New Cache Update Scheme Using Reinforcement Learning for Coded Video Streaming Systems

Yu-Sin Kim; Jeong-Min Lee; Jong-Yeol Ryu; Tae-Won Ban

doi:10.3390/s21082867

A New Cache Update Scheme Using Reinforcement Learning for Coded Video Streaming Systems

Sensors (Basel). 2021 Apr 19;21(8):2867. doi: 10.3390/s21082867.

Authors

Yu-Sin Kim¹, Jeong-Min Lee², Jong-Yeol Ryu², Tae-Won Ban²

Affiliations

¹ Algorithm Team, Carvi, Seoul 08513, Korea.
² Department of Information and Communication Engineering, Gyeongsang National University, Gyeongnam 53064, Korea.

Abstract

As the demand for video streaming has been rapidly increasing recently, new technologies for improving the efficiency of video streaming have attracted much attention. In this paper, we thus investigate how to improve the efficiency of video streaming by using clients' cache storage considering exclusive OR (XOR) coding-based video streaming where multiple different video contents can be simultaneously transmitted in one transmission as long as prerequisite conditions are satisfied, and the efficiency of video streaming can be thus significantly enhanced. We also propose a new cache update scheme using reinforcement learning. The proposed scheme uses a K-actor-critic (K-AC) network that can mitigate the disadvantage of actor-critic networks by yielding K candidate outputs and by selecting the final output with the highest value out of the K candidates. The K-AC exists in each client, and each client can train it by using only locally available information without any feedback or signaling so that the proposed cache update scheme is a completely decentralized scheme. The performance of the proposed cache update scheme was analyzed in terms of the average number of transmissions for XOR coding-based video streaming and was compared to that of conventional cache update schemes. Our numerical results show that the proposed cache update scheme can reduce the number of transmissions up to 24% when the number of videos is 100, the number of clients is 50, and the cache size is 5.

Keywords: cache; exclusive OR; multimedia; reinforcement learning; streaming.

Grants and funding

No. 2020R1I1A3061195/National Research Foundation of Korea