Gaze Estimation Approach Using Deep Differential Residual Network

Longzhao Huang; Yujie Li; Xu Wang; Haoyu Wang; Ahmed Bouridane; Ahmad Chaddad

doi:10.3390/s22145462

Gaze Estimation Approach Using Deep Differential Residual Network

Sensors (Basel). 2022 Jul 21;22(14):5462. doi: 10.3390/s22145462.

Authors

Longzhao Huang¹, Yujie Li¹, Xu Wang¹, Haoyu Wang¹, Ahmed Bouridane², Ahmad Chaddad^{1

3}

Affiliations

¹ School of Artificial Intelligence, Guilin University of Electronic Technology, Jinji Road, Guilin 541004, China.
² Faculty of Engineering and Environment, Northumbria University, Newcastle NE18ST, UK.
³ The Laboratory for Imagery Vision and Artificial Intelligence, Ecole de Technologie Superieure, 1100 Rue Notre Dame O, Montreal, QC H3C1K3, Canada.

Abstract

Gaze estimation, which is a method to determine where a person is looking at given the person's full face, is a valuable clue for understanding human intention. Similarly to other domains of computer vision, deep learning (DL) methods have gained recognition in the gaze estimation domain. However, there are still gaze calibration problems in the gaze estimation domain, thus preventing existing methods from further improving the performances. An effective solution is to directly predict the difference information of two human eyes, such as the differential network (Diff-Nn). However, this solution results in a loss of accuracy when using only one inference image. We propose a differential residual model (DRNet) combined with a new loss function to make use of the difference information of two eye images. We treat the difference information as auxiliary information. We assess the proposed model (DRNet) mainly using two public datasets (1) MpiiGaze and (2) Eyediap. Considering only the eye features, DRNet outperforms the state-of-the-art gaze estimation methods with angular-error of 4.57 and 6.14 using MpiiGaze and Eyediap datasets, respectively. Furthermore, the experimental results also demonstrate that DRNet is extremely robust to noise images.

Keywords: differential residual network; gaze calibration; gaze estimation; noise image.

MeSH terms

Eye
Eye Movements*
Fixation, Ocular*
Humans

Abstract

MeSH terms

Grants and funding