Two-Stream Attention Network for Pain Recognition from Video Sequences

Sensors (Basel). 2020 Feb 4;20(3):839. doi: 10.3390/s20030839.

Abstract

Several approaches have been proposed for the analysis of pain-related facial expressions. These approaches range from common classification architectures based on a set of carefully designed handcrafted features, to deep neural networks characterised by an autonomous extraction of relevant facial descriptors and simultaneous optimisation of a classification architecture. In the current work, an end-to-end approach based on attention networks for the analysis and recognition of pain-related facial expressions is proposed. The method combines both spatial and temporal aspects of facial expressions through a weighted aggregation of attention-based neural networks' outputs, based on sequences of Motion History Images (MHIs) and Optical Flow Images (OFIs). Each input stream is fed into a specific attention network consisting of a Convolutional Neural Network (CNN) coupled to a Bidirectional Long Short-Term Memory (BiLSTM) Recurrent Neural Network (RNN). An attention mechanism generates a single weighted representation of each input stream (MHI sequence and OFI sequence), which is subsequently used to perform specific classification tasks. Simultaneously, a weighted aggregation of the classification scores specific to each input stream is performed to generate a final classification output. The assessment conducted on both the BioVid Heat Pain Database (Part A) and SenseEmotion Database points at the relevance of the proposed approach, as its classification performance is on par with state-of-the-art classification approaches proposed in the literature.

Keywords: convolutional neural networks, long short-term memory recurrent neural networks, information fusion, pain recognition.

MeSH terms

  • Algorithms
  • Attention
  • Calibration
  • Databases, Factual
  • Diagnosis, Computer-Assisted / methods
  • Face
  • Facial Expression
  • Healthy Volunteers
  • Humans
  • Image Processing, Computer-Assisted / methods
  • Machine Learning
  • Memory, Short-Term
  • Models, Statistical
  • Neural Networks, Computer
  • Pain
  • Pain Measurement / methods*
  • Pattern Recognition, Automated*
  • Reproducibility of Results
  • Temperature
  • Video Recording*