Deep Learning Framework for Controlling Work Sequence in Collaborative Human-Robot Assembly Processes

Pedro P Garcia; Telmo G Santos; Miguel A Machado; Nuno Mendes

doi:10.3390/s23010553

Deep Learning Framework for Controlling Work Sequence in Collaborative Human-Robot Assembly Processes

Sensors (Basel). 2023 Jan 3;23(1):553. doi: 10.3390/s23010553.

Authors

Pedro P Garcia¹, Telmo G Santos^{1

2}, Miguel A Machado^{1

2}, Nuno Mendes^{1

2}

Affiliations

¹ UNIDEMI, Department of Mechanical and Industrial Engineering, NOVA School of Science and Technology, Universidade NOVA de Lisboa, 2829-516 Caparica, Portugal.
² Laboratório Associado de Sistemas Inteligentes, LASI, 4800-058 Guimarães, Portugal.

Abstract

The human-robot collaboration (HRC) solutions presented so far have the disadvantage that the interaction between humans and robots is based on the human's state or on specific gestures purposely performed by the human, thus increasing the time required to perform a task and slowing down the pace of human labor, making such solutions uninteresting. In this study, a different concept of the HRC system is introduced, consisting of an HRC framework for managing assembly processes that are executed simultaneously or individually by humans and robots. This HRC framework based on deep learning models uses only one type of data, RGB camera data, to make predictions about the collaborative workspace and human action, and consequently manage the assembly process. To validate the HRC framework, an industrial HRC demonstrator was built to assemble a mechanical component. Four different HRC frameworks were created based on the convolutional neural network (CNN) model structures: Faster R-CNN ResNet-50 and ResNet-101, YOLOv2 and YOLOv3. The HRC framework with YOLOv3 structure showed the best performance, showing a mean average performance of 72.26% and allowed the HRC industrial demonstrator to successfully complete all assembly tasks within a desired time window. The HRC framework has proven effective for industrial assembly applications.

Keywords: deep learning; human–robot collaborative assembly; online class detection; visual assembly task recognition.

MeSH terms

Deep Learning*
Gestures
Humans
Neural Networks, Computer
Robotics*

Grants and funding

UIDB/00667/ 2020/Fundação para a Ciência e Tecnologia