Real-Time Multi-Scale Face Detector on Embedded Devices

Xu Zhao; Xiaoqing Liang; Chaoyang Zhao; Ming Tang; Jinqiao Wang

doi:10.3390/s19092158

Real-Time Multi-Scale Face Detector on Embedded Devices

Sensors (Basel). 2019 May 9;19(9):2158. doi: 10.3390/s19092158.

Authors

Xu Zhao^{1

2}, Xiaoqing Liang^{3

4}, Chaoyang Zhao^{5

6}, Ming Tang^{7

8}, Jinqiao Wang^{9

10}

Affiliations

¹ National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China. xu.zhao@nlpr.ia.ac.cn.
² University of Chinese Academy of Sciences, Beijing 100049, China. xu.zhao@nlpr.ia.ac.cn.
³ National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China. xiaoqing.liang@nlpr.ia.ac.cn.
⁴ University of Chinese Academy of Sciences, Beijing 100049, China. xiaoqing.liang@nlpr.ia.ac.cn.
⁵ National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China. chaoyang.zhao@nlpr.ia.ac.cn.
⁶ University of Chinese Academy of Sciences, Beijing 100049, China. chaoyang.zhao@nlpr.ia.ac.cn.
⁷ National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China. tangm@nlpr.ia.ac.cn.
⁸ University of Chinese Academy of Sciences, Beijing 100049, China. tangm@nlpr.ia.ac.cn.
⁹ National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China. jqwang@nlpr.ia.ac.cn.
¹⁰ University of Chinese Academy of Sciences, Beijing 100049, China. jqwang@nlpr.ia.ac.cn.

Abstract

Face detection is the basic step in video face analysis and has been studied for many years. However, achieving real-time performance on computation-resource-limited embedded devices still remains an open challenge. To address this problem, in this paper we propose a face detector, EagleEye, which shows a good trade-off between high accuracy and fast speed on the popular embedded device with low computation power (e.g., the Raspberry Pi 3b+). The EagleEye is designed to have low floating-point operations per second (FLOPS) as well as enough capacity, and its accuracy is further improved without adding too much FLOPS. Specifically, we design five strategies for building efficient face detectors with a good balance of accuracy and running speed. The first two strategies help to build a detector with low computation complexity and enough capacity. We use convolution factorization to change traditional convolutions into more sparse depth-wise convolutions to save computation costs and we use successive downsampling convolutions at the beginning of the face detection network. The latter three strategies significantly improve the accuracy of the light-weight detector without adding too much computation costs. We design an efficient context module to utilize context information to benefit the face detection. We also adopt information preserving activation function to increase the network capacity. Finally, we use focal loss to further improve the accuracy by handling the class imbalance problem better. Experiments show that the EagleEye outperforms the other face detectors with the same order of computation costs, on both runtime efficiency and accuracy.

Keywords: ARM-based devices; computer vision; face detection; model acceleration.

MeSH terms

Algorithms
Face / physiology*
Humans
Image Processing, Computer-Assisted
Pattern Recognition, Automated / methods

Abstract

MeSH terms

Grants and funding