Pattern Recognition Methods and Features Selection for Speech Emotion Recognition System

Pavol Partila; Miroslav Voznak; Jaromir Tovarek

doi:10.1155/2015/573068

Pattern Recognition Methods and Features Selection for Speech Emotion Recognition System

ScientificWorldJournal. 2015:2015:573068. doi: 10.1155/2015/573068. Epub 2015 Aug 4.

Authors

Pavol Partila¹, Miroslav Voznak¹, Jaromir Tovarek¹

Affiliation

¹ Department of Telecommunications, Faculty of Electrical Engineering and Computer Science, VSB-Technical University of Ostrava, 17 Listopadu 15, 70833 Ostrava, Czech Republic.

Abstract

The impact of the classification method and features selection for the speech emotion recognition accuracy is discussed in this paper. Selecting the correct parameters in combination with the classifier is an important part of reducing the complexity of system computing. This step is necessary especially for systems that will be deployed in real-time applications. The reason for the development and improvement of speech emotion recognition systems is wide usability in nowadays automatic voice controlled systems. Berlin database of emotional recordings was used in this experiment. Classification accuracy of artificial neural networks, k-nearest neighbours, and Gaussian mixture model is measured considering the selection of prosodic, spectral, and voice quality features. The purpose was to find an optimal combination of methods and group of features for stress detection in human speech. The research contribution lies in the design of the speech emotion recognition system due to its accuracy and efficiency.

MeSH terms

Algorithms*
Databases, Factual
Emotions / physiology*
Humans
Neural Networks, Computer
Pattern Recognition, Automated*
Pattern Recognition, Physiological / physiology
ROC Curve
Signal Processing, Computer-Assisted / instrumentation*
Speech / physiology*
Voice Quality