Enhancing multi-UAV air combat decision making via hierarchical reinforcement learning

Huan Wang; Jintao Wang

doi:10.1038/s41598-024-54938-5

Enhancing multi-UAV air combat decision making via hierarchical reinforcement learning

Sci Rep. 2024 Feb 23;14(1):4458. doi: 10.1038/s41598-024-54938-5.

Authors

Huan Wang^{1

2}, Jintao Wang³

Affiliations

¹ College of Artificial Intelligence and Automation, Hohai University, Changzhou, 213200, China. whuan@hhu.edu.cn.
² College of information and Network Engineering, Anhui Science and Technology University, Chuzhou, 233030, China. whuan@hhu.edu.cn.
³ School of Electrical and Information Engineering, Wanjiang University of Technology, Maanshan, 243000, China.

Abstract

In the realm of air combat, autonomous decision-making in regard to Unmanned Aerial Vehicle (UAV) has emerged as a critical force. However, prevailing autonomous decision-making algorithms in this domain predominantly rely on rule-based methods, proving challenging to design and implement optimal solutions in complex multi-UAV combat environments. This paper proposes a novel approach to multi-UAV air combat decision-making utilizing hierarchical reinforcement learning. First, a hierarchical decision-making network is designed based on tactical action types to streamline the complexity of the maneuver decision-making space. Second, the high-quality combat experience gained from training is decomposed, with the aim of augmenting the quantity of valuable experiences and alleviating the intricacies of strategy learning. Finally, the performance of the algorithm is validated using the advanced UAV simulation platform JSBSim. Through comparisons with various baseline algorithms, our experiments demonstrate the superior performance of the proposed method in both even and disadvantaged air combat environments.

Abstract

Grants and funding