RGB-D Data-Based Action Recognition: A Review

Muhammad Bilal Shaikh; Douglas Chai

doi:10.3390/s21124246

RGB-D Data-Based Action Recognition: A Review

Sensors (Basel). 2021 Jun 21;21(12):4246. doi: 10.3390/s21124246.

Authors

Muhammad Bilal Shaikh¹, Douglas Chai¹

Affiliation

¹ School of Engineering, Edith Cowan University, Perth, WA 6027, Australia.

Abstract

Classification of human actions is an ongoing research problem in computer vision. This review is aimed to scope current literature on data fusion and action recognition techniques and to identify gaps and future research direction. Success in producing cost-effective and portable vision-based sensors has dramatically increased the number and size of datasets. The increase in the number of action recognition datasets intersects with advances in deep learning architectures and computational support, both of which offer significant research opportunities. Naturally, each action-data modality-such as RGB, depth, skeleton, and infrared (IR)-has distinct characteristics; therefore, it is important to exploit the value of each modality for better action recognition. In this paper, we focus solely on data fusion and recognition techniques in the context of vision with an RGB-D perspective. We conclude by discussing research challenges, emerging trends, and possible future research directions.

Keywords: RGB-D; action recognition; data fusion; deep learning.

Publication types

Review

MeSH terms

Algorithms*
Databases, Factual
Human Activities*
Humans
Skeleton
Vision, Ocular

Grants and funding

No.5-1/HRD/UESTPI(Batch-VI)/7108/2018/HEC/Higher Education Commission, Pakistan