Reinforcement Learning-Based Joint User Pairing and Power Allocation in MIMO-NOMA Systems

Sensors (Basel). 2020 Dec 11;20(24):7094. doi: 10.3390/s20247094.

Abstract

In this paper, we consider a multiple-input multiple-output (MIMO)-non-orthogonal multiple access (NOMA) system with reinforcement learning (RL). NOMA, which is a technique for increasing the spectrum efficiency, has been extensively studied in fifth-generation (5G) wireless communication systems. The application of MIMO to NOMA can result in an even higher spectral efficiency. Moreover, user pairing and power allocation problem are important techniques in NOMA. However, NOMA has a fundamental limitation of the high computational complexity due to rapidly changing radio channels. This limitation makes it difficult to utilize the characteristics of the channel and allocate radio resources efficiently. To reduce the computational complexity, we propose an RL-based joint user pairing and power allocation scheme. By applying Q-learning, we are able to perform user pairing and power allocation simultaneously, which reduces the computational complexity. The simulation results show that the proposed scheme achieves a sum rate similar to that achieved with the exhaustive search (ES).

Keywords: multiple-input multiple-output; non-orthogonal multiple access; power allocation; reinforcement learning; user pairing.