Unmasking the Devil in the Details: What Works for Deep Facial Action Coding?

Koichiro Niinuma; Laszlo A Jeni; Itir Onal Ertugrul; Jeffrey F Cohn

Unmasking the Devil in the Details: What Works for Deep Facial Action Coding?

BMVC. 2019 Sep:2019:4.

Authors

Koichiro Niinuma¹, Laszlo A Jeni², Itir Onal Ertugrul², Jeffrey F Cohn³

Affiliations

¹ Fujitsu Laboratories of America Pittsburgh, PA, USA.
² Robotics Institute Carnegie Mellon University Pittsburgh, PA, USA.
³ Department of Psychology University of Pittsburgh Pittsburgh, PA, USA.

PMID: 32510058
PMCID: PMC7274256

Abstract

The performance of automated facial expression coding has improving steadily as evidenced by results of the latest Facial Expression Recognition and Analysis (FERA 2017) Challenge. Advances in deep learning techniques have been key to this success. Yet the contribution of critical design choices remains largely unknown. Using the FERA 2017 database, we systematically evaluated design choices in pre-training, feature alignment, model size selection, and optimizer details. Our findings vary from the counter-intuitive (e.g., generic pre-training outperformed face-specific models) to best practices in tuning optimizers. Informed by what we found, we developed an architecture that exceeded state-of-the-art on FERA 2017. We achieved a 3.5% increase in F₁ score for occurrence detection and a 5.8% increase in ICC for intensity estimation.

Abstract

Grants and funding