Combining Supervised and Unsupervised Learning Algorithms for Human Activity Recognition

Sensors (Basel). 2021 Sep 21;21(18):6309. doi: 10.3390/s21186309.

Abstract

Human activity recognition is an extensively researched topic in the last decade. Recent methods employ supervised and unsupervised deep learning techniques in which spatial and temporal dependency is modeled. This paper proposes a novel approach for human activity recognition using skeleton data. The method combines supervised and unsupervised learning algorithms in order to provide qualitative results and performance in real time. The proposed method involves a two-stage framework: the first stage applies an unsupervised clustering technique to group up activities based on their similarity, while the second stage classifies data assigned to each group using graph convolutional networks. Different clustering techniques and data augmentation strategies are explored for improving the training process. The results were compared against the state of the art methods and the proposed model achieved 90.22% Top-1 accuracy performance for NTU-RGB+D dataset (the performance was increased by approximately 9% compared with the baseline graph convolutional method). Moreover, inference time and total number of parameters stay within the same magnitude order. Extending the initial set of activities with additional classes is fast and robust, since there is no required retraining of the entire architecture but only to retrain the cluster to which the activity is assigned.

Keywords: Gaussian mixture model; clustering; human activity recognition; k-means; skeleton; spatial-temporal graph convolutional network.

MeSH terms

  • Algorithms
  • Cluster Analysis
  • Human Activities
  • Humans
  • Neural Networks, Computer*
  • Unsupervised Machine Learning*