Multi-Lane Differential Variable Speed Limit Control via Deep Neural Networks Optimized by an Adaptive Evolutionary Strategy

Jianshuai Feng; Tianyu Shi; Yuankai Wu; Xiang Xie; Hongwen He; Huachun Tan

doi:10.3390/s23104659

Multi-Lane Differential Variable Speed Limit Control via Deep Neural Networks Optimized by an Adaptive Evolutionary Strategy

Sensors (Basel). 2023 May 11;23(10):4659. doi: 10.3390/s23104659.

Authors

Jianshuai Feng¹, Tianyu Shi², Yuankai Wu³, Xiang Xie⁴, Hongwen He¹, Huachun Tan⁵

Affiliations

¹ School of Mechanical Engineering, Beijing Institute of Technology, Beijing 100081, China.
² Intelligent Transportation Systems Centre, University of Toronto, Toronto, ON M5S 1A4, Canada.
³ National Key Laboratory of Fundamental Science on Synthetic Vision, Sichuan University, Chengdu 610065, China.
⁴ School of Information and Electronics, Beijing Institute of Technology, Beijing 100081, China.
⁵ Advanced Research Institute of Multidisciplinary Sciences, Beijing Institute of Technology, Beijing 100081, China.

Abstract

In advanced transportation-management systems, variable speed limits are a crucial application. Deep reinforcement learning methods have been shown to have superior performance in many applications, as they are an effective approach to learning environment dynamics for decision-making and control. However, they face two significant difficulties in traffic-control applications: reward engineering with delayed reward and brittle convergence properties with gradient descent. To address these challenges, evolutionary strategies are well suited as a class of black-box optimization techniques inspired by natural evolution. Additionally, the traditional deep reinforcement learning framework struggles to handle the delayed reward setting. This paper proposes a novel approach using covariance matrix adaptation evolution strategy (CMA-ES), a gradient-free global optimization method, to handle the task of multi-lane differential variable speed limit control. The proposed method uses a deep-learning-based method to dynamically learn optimal and distinct speed limits among lanes. The parameters of the neural network are sampled using a multivariate normal distribution, and the dependencies between the variables are represented by a covariance matrix that is optimized dynamically by CMA-ES based on the freeway's throughput. The proposed approach is tested on a freeway with simulated recurrent bottlenecks, and the experimental results show that it outperforms deep reinforcement learning-based approaches, traditional evolutionary search methods, and the no-control scenario. Our proposed method demonstrates a 23% improvement in average travel time and an average of a 4% improvement in CO, HC, and NOx emission.Furthermore, the proposed method produces explainable speed limits and has desirable generalization power.

Keywords: connected and autonomous vehicles; deep neural networks; evolutionary strategies; variable speed limits.

Grants and funding

2019YFB1600100/National Key Research and Development Program of China