Application of poincare-mapping of voiced-speech segments for emotion sensing

Krzysztof Slot; Lukasz Bronakowski; Jaroslaw Cichosz; Hyongsuk Kim

doi:10.3390/s91209858

Application of poincare-mapping of voiced-speech segments for emotion sensing

Sensors (Basel). 2009;9(12):9858-72. doi: 10.3390/s91209858. Epub 2009 Dec 3.

Authors

Krzysztof Slot¹, Lukasz Bronakowski, Jaroslaw Cichosz, Hyongsuk Kim

Affiliation

¹ Institute of Electronics, Technical University of Lodz, Poland, Wolczanska 213/215, 90-924 Lodz, Poland; E-Mails: kslot@p.lodz.pl (K.S.); lukasz.bronakowski@p.lodz.pl (L.B.); jarekcichosz@poczta.onet.pl (J.C.).

Abstract

The following paper introduces a group of novel speech-signal descriptors that reflect phoneme-pronunciation variability and that can be considered as potentially useful features for emotion sensing. The proposed group includes a set of statistical parameters of Poincare maps, derived for formant-frequency evolution and energy evolution of voiced-speech segments. Two groups of Poincare-map characteristics were considered in the research: descriptors of sample-scatter, which reflect magnitudes of phone-uttering variations and descriptors of cross-correlations that exist among samples and that evaluate consistency of variations. It has been shown that inclusion of the proposed characteristics into the pool of commonly used speech descriptors, results in a noticeable increase-at the level of 10%-in emotion sensing performance. Standard pattern recognition methodology has been adopted for evaluation of the proposed descriptors, with the assumption that three- or four-dimensional feature spaces can provide sufficient emotion sensing. Binary decision trees have been selected for data classification, as they provide with detailed information on emotion-specific discriminative power of various speech descriptors.

Keywords: Poincare-maps; decision-trees; emotion sensing; feature selection.