Comparison of Graph Fitting and Sparse Deep Learning Model for Robot Pose Estimation

Sensors (Basel). 2022 Aug 29;22(17):6518. doi: 10.3390/s22176518.

Abstract

The paper presents a simple, yet robust computer vision system for robot arm tracking with the use of RGB-D cameras. Tracking means to measure in real time the robot state given by three angles and with known restrictions about the robot geometry. The tracking system consists of two parts: image preprocessing and machine learning. In the machine learning part, we compare two approaches: fitting the robot pose to the point cloud and fitting the convolutional neural network model to the sparse 3D depth images. The advantage of the presented approach is direct use of the point cloud transformed to the sparse image in the network input and use of sparse convolutional and pooling layers (sparse CNN). The experiments confirm that the robot tracking is performed in real time and with an accuracy comparable to the accuracy of the depth sensor.

Keywords: arm tracking; computer vision; depth camera; pose estimation; pose fitting; robot tracking; sparse CNN; sparse deep learning.

MeSH terms

  • Algorithms
  • Artificial Intelligence
  • Deep Learning*
  • Neural Networks, Computer
  • Robotics* / methods

Grants and funding

This research received no external funding.