Heterogeneous Face Interpretable Disentangled Representation for Joint Face Recognition and Synthesis

IEEE Trans Neural Netw Learn Syst. 2022 Oct;33(10):5611-5625. doi: 10.1109/TNNLS.2021.3071119. Epub 2022 Oct 5.

Abstract

Heterogeneous faces are acquired with different sensors, which are closer to real-world scenarios and play an important role in the biometric security field. However, heterogeneous face analysis is still a challenging problem due to the large discrepancy between different modalities. Recent works either focus on designing a novel loss function or network architecture to directly extract modality-invariant features or synthesizing the same modality faces initially to decrease the modality gap. Yet, the former always lacks explicit interpretability, and the latter strategy inherently brings in synthesis bias. In this article, we explore to learn the plain interpretable representation for complex heterogeneous faces and simultaneously perform face recognition and synthesis tasks. We propose the heterogeneous face interpretable disentangled representation (HFIDR) that could explicitly interpret dimensions of face representation rather than simple mapping. Benefited from the interpretable structure, we further could extract latent identity information for cross-modality recognition and convert the modality factor to synthesize cross-modality faces. Moreover, we propose a multimodality heterogeneous face interpretable disentangled representation (M-HFIDR) to extend the basic approach suitable for the multimodality face recognition and synthesis. To evaluate the ability of generalization, we construct a novel large-scale face sketch data set. Experimental results on multiple heterogeneous face databases demonstrate the effectiveness of the proposed method.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biometric Identification* / methods
  • Databases, Factual
  • Face / anatomy & histology
  • Facial Recognition*
  • Neural Networks, Computer