Energy-Efficient Data Collection Using Autonomous Underwater Glider: A Reinforcement Learning Formulation

Xinbin Li; Xianglin Xu; Lei Yan; Haihong Zhao; Tongwei Zhang

doi:10.3390/s20133758

Energy-Efficient Data Collection Using Autonomous Underwater Glider: A Reinforcement Learning Formulation

Sensors (Basel). 2020 Jul 4;20(13):3758. doi: 10.3390/s20133758.

Authors

Xinbin Li¹, Xianglin Xu¹, Lei Yan¹, Haihong Zhao¹, Tongwei Zhang²

Affiliations

¹ Institute of Electrical Engineering, Yanshan University, Qinhuangdao 066004, China.
² National Deep Sea Center, Qingdao 266237, China.

Abstract

The autonomous underwater glider has attracted enormous interest for underwater activities, especially in long-term and large-scale underwater data collection. In this paper, we focus on the application of gliders gathering data from underwater sensor networks over underwater acoustic channels. However, this application suffers from a rapidly time-varying environment and limited energy. To optimize the performance of data collection and maximize the network lifetime, we propose a distributed, energy-efficient sensor scheduling algorithm based on the multi-armed bandit formulation. Besides, we design an indexable threshold policy to tradeoff between the data quality and the collection delay. Moreover, to reduce the computational complexity, we divide the proposed algorithm into off-line computation and on-line scheduling parts. Simulation results indicate that the proposed policy significantly improves the performance of the data collection and reduces the energy consumption. They prove the effectiveness of the threshold, which could reduce the collection delay by at least 10% while guaranteeing the data quality.

Keywords: autonomous underwater glider; energy efficiency; indexable threshold policy; multi-armed bandit; underwater data collection.

Grants and funding

61873224, 41976182, 61873223 and 61773333/National Natural Science Foundation of China