Lightweight Semantic-Guided Neural Networks Based on Single Head Attention for Action Recognition

Sensors (Basel). 2022 Nov 28;22(23):9249. doi: 10.3390/s22239249.

Abstract

Skeleton-based action recognition can achieve a relatively high performance by transforming the human skeleton structure in an image into a graph and applying action recognition based on structural changes in the body. Among the many graph convolutional network (GCN) approaches used in skeleton-based action recognition, semantic-guided neural networks (SGNs) are fast action recognition algorithms that hierarchically learn spatial and temporal features by applying a GCN. However, because an SGN focuses on global feature learning rather than local feature learning owing to the structural characteristics, there is a limit to an action recognition in which the dependency between neighbouring nodes is important. To solve these problems and simultaneously achieve a real-time action recognition in low-end devices, in this study, a single head attention (SHA) that can overcome the limitations of an SGN is proposed, and a new SGN-SHA model that combines SHA with an SGN is presented. In experiments on various action recognition benchmark datasets, the proposed SGN-SHA model significantly reduced the computational complexity while exhibiting a performance similar to that of an existing SGN and other state-of-the-art methods.

Keywords: action recognition; graph convolutional networks; semantic-guided neural networks; single head attention; skeletal structure.

MeSH terms

  • Algorithms
  • Benchmarking
  • Humans
  • Learning*
  • Neural Networks, Computer
  • Recognition, Psychology*