UAV Trajectory Optimization in a Post-Disaster Area Using Dual Energy-Aware Bandits

Sensors (Basel). 2023 Jan 26;23(3):1402. doi: 10.3390/s23031402.

Abstract

Over the past few years, with the rapid increase in the number of natural disasters, the need to provide smart emergency wireless communication services has become crucial. Unmanned aerial Vehicles (UAVs) have gained much attention as promising candidates due to their unprecedented capabilities and broad flexibility. In this paper, we investigate a UAV-based emergency wireless communication network for a post-disaster area. Our optimization problem aims to optimize the UAV's flight trajectory to maximize the number of visited ground users during the flight period. Then, a dual cost-aware multi-armed bandit algorithm is adopted to tackle this problem under the limited available energy for both the UAV and ground users. Simulation results show that the proposed algorithm could solve the optimization problem and maximize the achievable throughput under these energy constraints.

Keywords: cost subsidy; multi-armed bandit; post-disaster; reinforcement learning; trajectory optimization; unmanned aerial vehicle.

Grants and funding

This research received no external funding.