Soft DAgger: Sample-Efficient Imitation Learning for Control of Soft Robots

Muhammad Sunny Nazeer; Cecilia Laschi; Egidio Falotico

doi:10.3390/s23198278

Soft DAgger: Sample-Efficient Imitation Learning for Control of Soft Robots

Sensors (Basel). 2023 Oct 6;23(19):8278. doi: 10.3390/s23198278.

Authors

Muhammad Sunny Nazeer^{1

2}, Cecilia Laschi³, Egidio Falotico^{1

2}

Affiliations

¹ The BioRobotics Institute, Scuola Superiore Sant'Anna, 56025 Pontedera, Italy.
² Department of Excellence in Robotics and AI, Scuola Superiore Sant'Anna, 56125 Pisa, Italy.
³ Department of Mechanical Engineering, National University of Singapore, Singapore 117575, Singapore.

Abstract

This paper presents Soft DAgger, an efficient imitation learning-based approach for training control solutions for soft robots. To demonstrate the effectiveness of the proposed algorithm, we implement it on a two-module soft robotic arm involved in the task of writing letters in 3D space. Soft DAgger uses a dynamic behavioral map of the soft robot, which maps the robot's task space to its actuation space. The map acts as a teacher and is responsible for predicting the optimal actions for the soft robot based on its previous state action history, expert demonstrations, and current position. This algorithm achieves generalization ability without depending on costly exploration techniques or reinforcement learning-based synthetic agents. We propose two variants of the control algorithm and demonstrate that good generalization capabilities and improved task reproducibility can be achieved, along with a consistent decrease in the optimization time and samples. Overall, Soft DAgger provides a practical control solution to perform complex tasks in fewer samples with soft robots. To the best of our knowledge, our study is an initial exploration of imitation learning with online optimization for soft robot control.

Keywords: DAgger algorithm; Soft DAgger; dynamic behavioral mapping; imitation learning; online optimization; soft robotics; soft robots control.

Abstract

Grants and funding