A study on expression recognition based on improved mobilenetV2 network

Qiming Zhu; Hongwei Zhuang; Mi Zhao; Shuangchao Xu; Rui Meng

doi:10.1038/s41598-024-58736-x

A study on expression recognition based on improved mobilenetV2 network

Sci Rep. 2024 Apr 7;14(1):8121. doi: 10.1038/s41598-024-58736-x.

Authors

Qiming Zhu^#¹, Hongwei Zhuang^#², Mi Zhao³, Shuangchao Xu¹, Rui Meng⁴

Affiliations

¹ College of Equipment Support and Management, Engineering University of PAP, Xi'an, 710086, China.
² College of Equipment Support and Management, Engineering University of PAP, Xi'an, 710086, China. zhuanghw01@163.com.
³ Basic Education, Engineering University of PAP, Xi'an, 710086, China.
⁴ College of Military Basic Education, Engineering University of PAP, Xi'an, 710086, China.

^# Contributed equally.

Abstract

This paper proposes an improved strategy for the MobileNetV2 neural network(I-MobileNetV2) in response to problems such as large parameter quantities in existing deep convolutional neural networks and the shortcomings of the lightweight neural network MobileNetV2 such as easy loss of feature information, poor real-time performance, and low accuracy rate in facial emotion recognition tasks. The network inherits the characteristics of MobilenetV2 depthwise separated convolution, signifying a reduction in computational load while maintaining a lightweight profile. It utilizes a reverse fusion mechanism to retain negative features, which makes the information less likely to be lost. The SELU activation function is used to replace the RELU6 activation function to avoid gradient vanishing. Meanwhile, to improve the feature recognition capability, the channel attention mechanism (Squeeze-and-Excitation Networks (SE-Net)) is integrated into the MobilenetV2 network. Experiments conducted on the facial expression datasets FER2013 and CK + showed that the proposed network model achieved facial expression recognition accuracies of 68.62% and 95.96%, improving upon the MobileNetV2 model by 0.72% and 6.14% respectively, and the parameter count decreased by 83.8%. These results empirically verify the effectiveness of the improvements made to the network model.

Keywords: Attention mechanism; Expression recognition; MobileNetV2; Reverse fusion; SELU.

MeSH terms

Accidental Injuries*
Facial Recognition*
Humans
Neural Networks, Computer
Recognition, Psychology