Manifold Regularized Reinforcement Learning

IEEE Trans Neural Netw Learn Syst. 2018 Apr;29(4):932-943. doi: 10.1109/TNNLS.2017.2650943. Epub 2017 Jan 27.

Abstract

This paper introduces a novel manifold regularized reinforcement learning scheme for continuous Markov decision processes. Smooth feature representations for value function approximation can be automatically learned using the unsupervised manifold regularization method. The learned features are data-driven, and can be adapted to the geometry of the state space. Furthermore, the scheme provides a direct basis representation extension for novel samples during policy learning and control. The performance of the proposed scheme is evaluated on two benchmark control tasks, i.e., the inverted pendulum and the energy storage problem. Simulation results illustrate the concepts of the proposed scheme and show that it can obtain excellent performance.

Publication types

  • Research Support, Non-U.S. Gov't