A target-driven visual navigation method based on intrinsic motivation exploration and space topological cognition

Xiaogang Ruan; Peng Li; Xiaoqing Zhu; Pengfei Liu

doi:10.1038/s41598-022-07264-7

A target-driven visual navigation method based on intrinsic motivation exploration and space topological cognition

Sci Rep. 2022 Mar 2;12(1):3462. doi: 10.1038/s41598-022-07264-7.

Authors

Xiaogang Ruan^{1

2}, Peng Li^{1

2}, Xiaoqing Zhu^{3

4}, Pengfei Liu^{1

2}

Affiliations

¹ Faculty of Information Technology, Beijing University of Technology, Beijing, China.
² Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing, China.
³ Faculty of Information Technology, Beijing University of Technology, Beijing, China. alex.zhuxq@bjut.edu.cn.
⁴ Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing, China. alex.zhuxq@bjut.edu.cn.

Abstract

Target-driven visual navigation is essential for many applications in robotics, and it has gained increasing interest in recent years. In this work, inspired by animal cognitive mechanisms, we propose a novel navigation architecture that simultaneously learns exploration policy and encodes environmental structure. First, to learn exploration policy directly from raw visual input, we use deep reinforcement learning as the basic framework and allow agents to create rewards for themselves as learning signals. In our approach, the reward for the current observation is driven by curiosity and calculated by a count-based approach and temporal distance. While agents learn exploration policy, we use temporal distance to find waypoints in observation sequences and incrementally describe the structure of the environment in a way that integrates episodic memory. Finally, space topological cognition is integrated into the model as a path planning module and combined with a locomotion network to obtain a more generalized approach to navigation. We test our approach in the DMlab, a visually rich 3D environment, and validate its exploration efficiency and navigation performance through extensive experiments. The experimental results show that our approach can explore and encode the environment more efficiently and has better capability in dealing with stochastic objects. In navigation tasks, agents can use space topological cognition to effectively reach the target and guide detour behaviour when a path is unavailable, exhibiting good environmental adaptability.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Animals
Cognition
Learning
Motivation*
Reinforcement, Psychology*
Reward