Controlling the Solo12 quadruped robot with deep reinforcement learning

Michel Aractingi; Pierre-Alexandre Léziart; Thomas Flayols; Julien Perez; Tomi Silander; Philippe Souères

doi:10.1038/s41598-023-38259-7

Controlling the Solo12 quadruped robot with deep reinforcement learning

Sci Rep. 2023 Jul 24;13(1):11945. doi: 10.1038/s41598-023-38259-7.

Authors

Michel Aractingi^{1

2}, Pierre-Alexandre Léziart³, Thomas Flayols³, Julien Perez⁴, Tomi Silander⁴, Philippe Souères³

Affiliations

¹ LAAS-CNRS, Université de Toulouse, 31400, Toulouse, France. Michel.aractingi@gmail.com.
² NAVER LABS Europe, 38240, Meylan, France. Michel.aractingi@gmail.com.
³ LAAS-CNRS, Université de Toulouse, 31400, Toulouse, France.
⁴ NAVER LABS Europe, 38240, Meylan, France.

Abstract

Quadruped robots require robust and general locomotion skills to exploit their mobility potential in complex and challenging environments. In this work, we present an implementation of a robust end-to-end learning-based controller on the Solo12 quadruped. Our method is based on deep reinforcement learning of joint impedance references. The resulting control policies follow a commanded velocity reference while being efficient in its energy consumption and easy to deploy. We detail the learning procedure and method for transfer on the real robot. We show elaborate experiments. Finally, we present experimental results of the learned locomotion on various grounds indoors and outdoors. These results show that the Solo12 robot is a suitable open-source platform for research combining learning and control because of the easiness in transferring and deploying learned controllers.