Multiple Self-Supervised Auxiliary Tasks for Target-Driven Visual Navigation Using Deep Reinforcement Learning

Wenzhi Zhang; Li He; Hongwei Wang; Liang Yuan; Wendong Xiao

doi:10.3390/e25071007

Multiple Self-Supervised Auxiliary Tasks for Target-Driven Visual Navigation Using Deep Reinforcement Learning

Entropy (Basel). 2023 Jun 30;25(7):1007. doi: 10.3390/e25071007.

Authors

Wenzhi Zhang¹, Li He¹, Hongwei Wang¹, Liang Yuan^{1

2}, Wendong Xiao¹

Affiliations

¹ School of Mechanical Engineering, Xinjiang University, Urumqi 830046, China.
² School of Information Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China.

Abstract

Visual navigation based on deep reinforcement learning requires a large amount of interaction with the environment, and due to the reward sparsity, it requires a large amount of training time and computational resources. In this paper, we focus on sample efficiency and navigation performance and propose a framework for visual navigation based on multiple self-supervised auxiliary tasks. Specifically, we present an LSTM-based dynamics model and an attention-based image-reconstruction model as auxiliary tasks. These self-supervised auxiliary tasks enable agents to learn navigation strategies directly from the original high-dimensional images without relying on ResNet features by constructing latent representation learning. Experimental results show that without manually designed features and prior demonstrations, our method significantly improves the training efficiency and outperforms the baseline algorithms on the simulator and real-world image datasets.

Keywords: deep reinforcement learning; representation learning; self-supervised auxiliary tasks; target-driven visual navigation.

Abstract

Grants and funding