Deep Reinforcement Learning Based Decision Making for Complex Jamming Waveforms

Entropy (Basel). 2022 Oct 10;24(10):1441. doi: 10.3390/e24101441.

Abstract

With the development of artificial intelligence, intelligent communication jamming decision making is an important research direction of cognitive electronic warfare. In this paper, we consider a complex intelligent jamming decision scenario in which both communication parties choose to adjust physical layer parameters to avoid jamming in a non-cooperative scenario and the jammer achieves accurate jamming by interacting with the environment. However, when the situation becomes complex and large in number, traditional reinforcement learning suffers from the problems of failure to converge and a high number of interactions, which are fatal and unrealistic in a real warfare environment. To solve this problem, we propose a deep reinforcement learning based and maximum-entropy-based soft actor-critic (SAC) algorithm. In the proposed algorithm, we add an improved Wolpertinger architecture to the original SAC algorithm in order to reduce the number of interactions and improve the accuracy of the algorithm. The results show that the proposed algorithm shows excellent performance in various scenarios of jamming and achieves accurate, fast, and continuous jamming for both sides of the communication.

Keywords: Wolpertinger architecture; cognitive radio; deep reinforcement learning; intelligent jamming; soft actor-critic.

Grants and funding

This work was supported in part by the National Natural Science Foundation of China under Grants U19B2016.