On the complexity of computing Markov perfect equilibrium in general-sum stochastic games

Xiaotie Deng; Ningyuan Li; David Mguni; Jun Wang; Yaodong Yang

doi:10.1093/nsr/nwac256

On the complexity of computing Markov perfect equilibrium in general-sum stochastic games

Natl Sci Rev. 2022 Nov 22;10(1):nwac256. doi: 10.1093/nsr/nwac256. eCollection 2023 Jan.

Authors

Xiaotie Deng^{1

2}, Ningyuan Li¹, David Mguni³, Jun Wang⁴, Yaodong Yang²

Affiliations

¹ Center on Frontiers of Computing Studies, School of Computer Science, Peking University, Beijing 100091, China.
² Center for Multi-Agent Research, Institute for AI, Peking University, Beijing 100091, China.
³ Huawei UK, London WC1E 6BT, UK.
⁴ Computer Science, University College London, London WC1E 6BT, UK.

Abstract

Similar to the role of Markov decision processes in reinforcement learning, Markov games (also called stochastic games) lay down the foundation for the study of multi-agent reinforcement learning and sequential agent interactions. We introduce approximate Markov perfect equilibrium as a solution to the computational problem of finite-state stochastic games repeated in the infinite horizon and prove its PPAD-completeness. This solution concept preserves the Markov perfect property and opens up the possibility for the success of multi-agent reinforcement learning algorithms on static two-player games to be extended to multi-agent dynamic games, expanding the reign of the PPAD-complete class.

Keywords: Markov game; Markov perfect equilibrium; PPAD-completeness; multi-agent reinforcement learning; stochastic game.