On the Evaluation of Diverse Vision Systems towards Detecting Human Pose in Collaborative Robot Applications

Aswin K Ramasubramanian; Marios Kazasidis; Barry Fay; Nikolaos Papakostas

doi:10.3390/s24020578

On the Evaluation of Diverse Vision Systems towards Detecting Human Pose in Collaborative Robot Applications

Sensors (Basel). 2024 Jan 17;24(2):578. doi: 10.3390/s24020578.

Authors

Aswin K Ramasubramanian¹, Marios Kazasidis¹, Barry Fay¹, Nikolaos Papakostas¹

Affiliation

¹ Laboratory for Advanced Manufacturing Simulation and Robotics, School of Mechanical and Materials Engineering, University College Dublin, Belfield, D04 V1W8 Dublin, Ireland.

Abstract

Tracking human operators working in the vicinity of collaborative robots can improve the design of safety architecture, ergonomics, and the execution of assembly tasks in a human-robot collaboration scenario. Three commercial spatial computation kits were used along with their Software Development Kits that provide various real-time functionalities to track human poses. The paper explored the possibility of combining the capabilities of different hardware systems and software frameworks that may lead to better performance and accuracy in detecting the human pose in collaborative robotic applications. This study assessed their performance in two different human poses at six depth levels, comparing the raw data and noise-reducing filtered data. In addition, a laser measurement device was employed as a ground truth indicator, together with the average Root Mean Square Error as an error metric. The obtained results were analysed and compared in terms of positional accuracy and repeatability, indicating the dependence of the sensors' performance on the tracking distance. A Kalman-based filter was applied to fuse the human skeleton data and then to reconstruct the operator's poses considering their performance in different distance zones. The results indicated that at a distance less than 3 m, Microsoft Azure Kinect demonstrated better tracking performance, followed by Intel RealSense D455 and Stereolabs ZED2, while at ranges higher than 3 m, ZED2 had superior tracking performance.

Keywords: collaborative robotics; data-fusion; human-tracking; markerless tracking; vision sensors.

MeSH terms

Ergonomics
Humans
Optical Devices*
Robotics*
Skeleton
Software

Grants and funding

820689/European Commission