Optimal Greedy Control in Reinforcement Learning

Alexander Gorobtsov; Oleg Sychev; Yulia Orlova; Evgeniy Smirnov; Olga Grigoreva; Alexander Bochkin; Marina Andreeva

doi:10.3390/s22228920

Optimal Greedy Control in Reinforcement Learning

Sensors (Basel). 2022 Nov 18;22(22):8920. doi: 10.3390/s22228920.

Authors

Alexander Gorobtsov^{1

2}, Oleg Sychev³, Yulia Orlova³, Evgeniy Smirnov¹, Olga Grigoreva¹, Alexander Bochkin¹, Marina Andreeva¹

Affiliations

¹ Higher Mathematics Department, Volgograd State Technical University, Lenin Ave, 28, Volgograd 400005, Russia.
² Mechanical Engineering Research Institute, Russian Academy of Sciences, Maly Kharitonyevsky Pereulok, 4, Moscow 101990, Russia.
³ Software Engineering Department, Volgograd State Technical University, Lenin Ave, 28, Volgograd 400005, Russia.

Abstract

We consider the problem of dimensionality reduction of state space in the variational approach to the optimal control problem, in particular, in the reinforcement learning method. The control problem is described by differential algebraic equations consisting of nonlinear differential equations and algebraic constraint equations interconnected with Lagrange multipliers. The proposed method is based on changing the Lagrange multipliers of one subset based on the Lagrange multipliers of another subset. We present examples of the application of the proposed method in robotics and vibration isolation in transport vehicles. The method is implemented in FRUND-a multibody system dynamics software package.

Keywords: machine learning; optimal control; reinforcement learning; robotics; variational methods.

Grants and funding

This research received no external funding.