Combining Electrodermal Activity and Speech Analysis towards a more Accurate Emotion Recognition System

Alberto Greco; Claudia Marzi; Antonio Lanata; Enzo Pasquale Scilingo; Nicola Vanello

doi:10.1109/EMBC.2019.8857745

Combining Electrodermal Activity and Speech Analysis towards a more Accurate Emotion Recognition System

Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul:2019:229-232. doi: 10.1109/EMBC.2019.8857745.

Authors

Alberto Greco, Claudia Marzi, Antonio Lanata, Enzo Pasquale Scilingo, Nicola Vanello

PMID: 31945884
DOI: 10.1109/EMBC.2019.8857745

Abstract

Current research in the emotion recognition field is exploring the possibility of merging the information from physiological signals, behavioural data, and speech. Electrodermal activity (EDA) is amongst the main psychophysiological arousal indicators. Nonetheless, it is quite difficult to be analyzed in ecological scenarios, like, for instance, when the subject is speaking. On the other hand, speech carries relevant information of subject emotional state and its potential in the field of affective computing is still to be fully exploited. In this work, we aim at exploring the possibility of merging the information from electrodermal activity (EDA) and speech to improve the recognition of human arousal level during the pronunciation of single affective words. Unlike the majority of studies in the literature, we focus on speakers' arousal rather than the emotion conveyed by the spoken word. Specifically, a support vector machine with recursive feature elimination strategy (SVM-RFE) is trained and tested on three datasets, i.e using the two channels (i.e., speech and EDA) separately and then jointly. The results show that the merging of EDA and speech information significantly improves the marginal classifier (+11.64%). The six selected features by the RFE procedure will be used for the development of a future multivariate model of emotions.

MeSH terms

Arousal
Emotions
Galvanic Skin Response*
Humans
Speech*
Support Vector Machine