Pattern Recognition Methods and Features Selection for Speech Emotion Recognition System

ScientificWorldJournal. 2015:2015:573068. doi: 10.1155/2015/573068. Epub 2015 Aug 4.

Abstract

The impact of the classification method and features selection for the speech emotion recognition accuracy is discussed in this paper. Selecting the correct parameters in combination with the classifier is an important part of reducing the complexity of system computing. This step is necessary especially for systems that will be deployed in real-time applications. The reason for the development and improvement of speech emotion recognition systems is wide usability in nowadays automatic voice controlled systems. Berlin database of emotional recordings was used in this experiment. Classification accuracy of artificial neural networks, k-nearest neighbours, and Gaussian mixture model is measured considering the selection of prosodic, spectral, and voice quality features. The purpose was to find an optimal combination of methods and group of features for stress detection in human speech. The research contribution lies in the design of the speech emotion recognition system due to its accuracy and efficiency.

MeSH terms

  • Algorithms*
  • Databases, Factual
  • Emotions / physiology*
  • Humans
  • Neural Networks, Computer
  • Pattern Recognition, Automated*
  • Pattern Recognition, Physiological / physiology
  • ROC Curve
  • Signal Processing, Computer-Assisted / instrumentation*
  • Speech / physiology*
  • Voice Quality