N-Step Pre-Training and Décalcomanie Data Augmentation for Micro-Expression Recognition

Sensors (Basel). 2022 Sep 3;22(17):6671. doi: 10.3390/s22176671.

Abstract

Facial expressions are divided into micro- and macro-expressions. Micro-expressions are low-intensity emotions presented for a short moment of about 0.25 s, whereas macro-expressions last up to 4 s. To derive micro-expressions, participants are asked to suppress their emotions as much as possible while watching emotion-inducing videos. However, it is a challenging process, and the number of samples collected tends to be less than those of macro-expressions. Because training models with insufficient data may lead to decreased performance, this study proposes two ways to solve the problem of insufficient data for micro-expression training. The first method involves N-step pre-training, which performs multiple transfer learning from action recognition datasets to those in the facial domain. Second, we propose Décalcomanie data augmentation, which is based on facial symmetry, to create a composite image by cutting and pasting both faces around their center lines. The results show that the proposed methods can successfully overcome the data shortage problem and achieve high performance.

Keywords: convolutional neural network (CNN); deep learning; emotion recognition; facial micro-expression; image processing.

MeSH terms

  • Emotions
  • Facial Expression*
  • Humans
  • Recognition, Psychology*
  • Videotape Recording