DSPose: Dual-Space-Driven Keypoint Topology Modeling for Human Pose Estimation

Sensors (Basel). 2023 Sep 3;23(17):7626. doi: 10.3390/s23177626.

Abstract

Human pose estimation is the basis of many downstream tasks, such as motor intervention, behavior understanding, and human-computer interaction. The existing human pose estimation methods rely too much on the similarity of keypoints at the image feature level, which is vulnerable to three problems: object occlusion, keypoints ghost, and neighbor pose interference. We propose a dual-space-driven topology model for the human pose estimation task. Firstly, the model extracts relatively accurate keypoints features through a Transformer-based feature extraction method. Then, the correlation of keypoints in the physical space is introduced to alleviate the error localization problem caused by excessive dependence on the feature-level representation of the model. Finally, through the graph convolutional neural network, the spatial correlation of keypoints and the feature correlation are effectively fused to obtain more accurate human pose estimation results. The experimental results on real datasets also further verify the effectiveness of our proposed model.

Keywords: Transformer; dual space; graph convolutional network; human pose estimation; keypoint detection.

MeSH terms

  • Electric Power Supplies*
  • Humans
  • Neural Networks, Computer*

Grants and funding

This work is supported by the Fundamental Research Funds for the Central Universities, HUST: 2022WKYXZX019, the Humanity and Social Science Youth Foundation of Ministry of Education of China (No. 22YJC890005), Hubei Province Social Science Fund General Project (subsequent funding) (No. HBSK2022YB562), the Hubei Natural Science Foundation Youth Project (No. 2023AFB359), and the Fundamental Research Funds for the Central Universities, CUG: G1323522067.