Human Motion Prediction via Dual-Attention and Multi-Granularity Temporal Convolutional Networks

Sensors (Basel). 2023 Jun 16;23(12):5653. doi: 10.3390/s23125653.

Abstract

Intelligent devices, which significantly improve the quality of life and work efficiency, are now widely integrated into people's daily lives and work. A precise understanding and analysis of human motion is essential for achieving harmonious coexistence and efficient interaction between intelligent devices and humans. However, existing human motion prediction methods often fail to fully exploit the dynamic spatial correlations and temporal dependencies inherent in motion sequence data, which leads to unsatisfactory prediction results. To address this issue, we proposed a novel human motion prediction method that utilizes dual-attention and multi-granularity temporal convolutional networks (DA-MgTCNs). Firstly, we designed a unique dual-attention (DA) model that combines joint attention and channel attention to extract spatial features from both joint and 3D coordinate dimensions. Next, we designed a multi-granularity temporal convolutional networks (MgTCNs) model with varying receptive fields to flexibly capture complex temporal dependencies. Finally, the experimental results from two benchmark datasets, Human3.6M and CMU-Mocap, demonstrated that our proposed method significantly outperformed other methods in both short-term and long-term prediction, thereby verifying the effectiveness of our algorithm.

Keywords: attention mechanism; human motion prediction; multi-granularity; temporal convolutional networks.

MeSH terms

  • Algorithms*
  • Benchmarking
  • Humans
  • Intelligence
  • Motion
  • Quality of Life*