A Bayesian Dynamical Approach for Human Action Recognition

Sensors (Basel). 2021 Aug 20;21(16):5613. doi: 10.3390/s21165613.

Abstract

We introduce a generative Bayesian switching dynamical model for action recognition in 3D skeletal data. Our model encodes highly correlated skeletal data into a few sets of low-dimensional switching temporal processes and from there decodes to the motion data and their associated action labels. We parameterize these temporal processes with regard to a switching deep autoregressive prior to accommodate both multimodal and higher-order nonlinear inter-dependencies. This results in a dynamical deep generative latent model that parses meaningful intrinsic states in skeletal dynamics and enables action recognition. These sequences of states provide visual and quantitative interpretations about motion primitives that gave rise to each action class, which have not been explored previously. In contrast to previous works, which often overlook temporal dynamics, our method explicitly model temporal transitions and is generative. Our experiments on two large-scale 3D skeletal datasets substantiate the superior performance of our model in comparison with the state-of-the-art methods. Specifically, our method achieved 6.3% higher action classification accuracy (by incorporating a dynamical generative framework), and 3.5% better predictive error (by employing a nonlinear second-order dynamical transition model) when compared with the best-performing competitors.

Keywords: 3D skeletal motion; bayesian inference; biologically valid interpretation; deep generative models; human action recognition; latent state modeling; motion capture; switching dynamical modeling; variational inference.

MeSH terms

  • Bayes Theorem
  • Humans
  • Motion
  • Nonlinear Dynamics*
  • Pattern Recognition, Automated*