Deep Reinforcement Learning Based Decision Making for Complex Jamming Waveforms

Yuting Xu; Chao Wang; Jiakai Liang; Keqiang Yue; Wenjun Li; Shilian Zheng; Zhijin Zhao

doi:10.3390/e24101441

Deep Reinforcement Learning Based Decision Making for Complex Jamming Waveforms

Entropy (Basel). 2022 Oct 10;24(10):1441. doi: 10.3390/e24101441.

Authors

Yuting Xu¹, Chao Wang¹, Jiakai Liang¹, Keqiang Yue^{1

2}, Wenjun Li¹, Shilian Zheng², Zhijin Zhao³

Affiliations

¹ Key Laboratory of RF Circuits and Systems, Ministry of Education, Hangzhou Dianzi University, Hangzhou 310018, China.
² Science and Technology on Communication Information Security Control Laboratory, The No. 011 Research Center, Jiaxing 314033, China.
³ The School of Communication Engineering, Hangzhou Dianzi University, Hangzhou 310018, China.

Abstract

With the development of artificial intelligence, intelligent communication jamming decision making is an important research direction of cognitive electronic warfare. In this paper, we consider a complex intelligent jamming decision scenario in which both communication parties choose to adjust physical layer parameters to avoid jamming in a non-cooperative scenario and the jammer achieves accurate jamming by interacting with the environment. However, when the situation becomes complex and large in number, traditional reinforcement learning suffers from the problems of failure to converge and a high number of interactions, which are fatal and unrealistic in a real warfare environment. To solve this problem, we propose a deep reinforcement learning based and maximum-entropy-based soft actor-critic (SAC) algorithm. In the proposed algorithm, we add an improved Wolpertinger architecture to the original SAC algorithm in order to reduce the number of interactions and improve the accuracy of the algorithm. The results show that the proposed algorithm shows excellent performance in various scenarios of jamming and achieves accurate, fast, and continuous jamming for both sides of the communication.

Keywords: Wolpertinger architecture; cognitive radio; deep reinforcement learning; intelligent jamming; soft actor-critic.

Grants and funding

This work was supported in part by the National Natural Science Foundation of China under Grants U19B2016.