A Robust Facial Expression Recognition Algorithm Based on Multi-Rate Feature Fusion Scheme

Sensors (Basel). 2021 Oct 20;21(21):6954. doi: 10.3390/s21216954.

Abstract

In recent years, the importance of catching humans' emotions grows larger as the artificial intelligence (AI) field is being developed. Facial expression recognition (FER) is a part of understanding the emotion of humans through facial expressions. We proposed a robust multi-depth network that can efficiently classify the facial expression through feeding various and reinforced features. We designed the inputs for the multi-depth network as minimum overlapped frames so as to provide more spatio-temporal information to the designed multi-depth network. To utilize a structure of a multi-depth network, a multirate-based 3D convolutional neural network (CNN) based on a multirate signal processing scheme was suggested. In addition, we made the input images to be normalized adaptively based on the intensity of the given image and reinforced the output features from all depth networks by the self-attention module. Then, we concatenated the reinforced features and classified the expression by a joint fusion classifier. Through the proposed algorithm, for the CK+ database, the result of the proposed scheme showed a comparable accuracy of 96.23%. For the MMI and the GEMEP-FERA databases, it outperformed other state-of-the-art models with accuracies of 96.69% and 99.79%. For the AFEW database, which is known as one in a very wild environment, the proposed algorithm achieved an accuracy of 31.02%.

Keywords: 3D convolutional neural network (3D CNN); deep learning; facial expression recognition (FER); minimum overlapped frame structure; multi-depth network; multirate signal processing; self-attention.

MeSH terms

  • Algorithms
  • Artificial Intelligence
  • Facial Expression
  • Facial Recognition*
  • Humans
  • Neural Networks, Computer