CLSQL: Improved Q-Learning Algorithm Based on Continuous Local Search Policy for Mobile Robot Path Planning

Tian Ma; Jiahao Lyu; Jiayi Yang; Runtao Xi; Yuancheng Li; Jinpeng An; Chao Li

doi:10.3390/s22155910

CLSQL: Improved Q-Learning Algorithm Based on Continuous Local Search Policy for Mobile Robot Path Planning

Sensors (Basel). 2022 Aug 8;22(15):5910. doi: 10.3390/s22155910.

Authors

Tian Ma¹, Jiahao Lyu¹, Jiayi Yang¹, Runtao Xi¹, Yuancheng Li¹, Jinpeng An¹, Chao Li¹

Affiliation

¹ College of Computer Science and Technology, Xi'an University of Science and Technology, Xi'an 710054, China.

Abstract

How to generate the path planning of mobile robots quickly is a problem in the field of robotics. The Q-learning(QL) algorithm has recently become increasingly used in the field of mobile robot path planning. However, its selection policy is blind in most cases in the early search process, which slows down the convergence of optimal solutions, especially in a complex environment. Therefore, in this paper, we propose a continuous local search Q-Learning (CLSQL) algorithm to solve these problems and ensure the quality of the planned path. First, the global environment is gradually divided into independent local environments. Then, the intermediate points are searched in each local environment with prior knowledge. After that, the search between each intermediate point is realized to reach the destination point. At last, by comparing other RL-based algorithms, the proposed method improves the convergence speed and computation time while ensuring the optimal path.

Keywords: Q-learning; complex environment; mobile robot; path planning; prior knowledge.

MeSH terms

Algorithms
Computer Simulation
Policy
Robotics* / methods

Abstract

MeSH terms

Grants and funding