Differential Graphical Games for Constrained Autonomous Vehicles Based on Viability Theory

Bowen Peng; Alexandru Stancu; Shuping Dang; Zhengtao Ding

doi:10.1109/TCYB.2021.3054430

Differential Graphical Games for Constrained Autonomous Vehicles Based on Viability Theory

IEEE Trans Cybern. 2022 Sep;52(9):8897-8910. doi: 10.1109/TCYB.2021.3054430. Epub 2022 Aug 18.

Authors

Bowen Peng, Alexandru Stancu, Shuping Dang, Zhengtao Ding

PMID: 33729967
DOI: 10.1109/TCYB.2021.3054430

Abstract

This article proposes an optimal-distributed control protocol for multivehicle systems with an unknown switching communication graph. The optimal-distributed control problem is formulated to differential graphical games, and the Pareto optimum to multiplayer games is sought based on the viability theory and reinforcement learning techniques. The viability theory characterizes the controllability of a wide range of constrained nonlinear systems; and the viability kernel and the capture basin are the pillars of the viability theory. The capture basin is the set of all initial states, in which there exist control strategies that enable the states to reach the target in finite time while remaining inside a set before reaching the target. In this regard, the feasible learning region is characterized by the reinforcement learner. In addition, the approximation of the capture basin provides the learner with prior knowledge. Unlike the existing works that employ the viability theory to solve control problems with only one agent and differential games with only two players, the viability theory, in this article, is utilized to solve multiagent control problems and multiplayer differential games. The distributed control law is composed of two parts: 1) the approximation of the capture basin and 2) reinforcement learning, which are computed offline and online, respectively. The convergence properties of the parameters' estimation errors in reinforcement learning are proved, and the convergence of the control policy to the Pareto optimum of the differential graphical game is discussed. The guaranteed approximation results of the capture basin are provided and the simulation results of the differential graphical game are provided for multivehicle systems with the proposed distributed control policy.

MeSH terms

Autonomous Vehicles*
Computer Simulation
Learning
Reinforcement, Psychology*