SUST Bangla Emotional Speech Corpus (SUBESCO): An audio-only emotional speech corpus for Bangla

PLoS One. 2021 Apr 30;16(4):e0250173. doi: 10.1371/journal.pone.0250173. eCollection 2021.

Abstract

SUBESCO is an audio-only emotional speech corpus for Bangla language. The total duration of the corpus is in excess of 7 hours containing 7000 utterances, and it is the largest emotional speech corpus available for this language. Twenty native speakers participated in the gender-balanced set, each recording of 10 sentences simulating seven targeted emotions. Fifty university students participated in the evaluation of this corpus. Each audio clip of this corpus, except those of Disgust emotion, was validated four times by male and female raters. Raw hit rates and unbiased rates were calculated producing scores above chance level of responses. Overall recognition rate was reported to be above 70% for human perception tests. Kappa statistics and intra-class correlation coefficient scores indicated high-level of inter-rater reliability and consistency of this corpus evaluation. SUBESCO is an Open Access database, licensed under Creative Common Attribution 4.0 International, and can be downloaded free of charge from the web link: https://doi.org/10.5281/zenodo.4526477.

Publication types

  • Dataset
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Bangladesh
  • Emotions
  • Female
  • Humans
  • India
  • Language
  • Male
  • Recognition, Psychology
  • Reproducibility of Results
  • Speech / classification*
  • Speech Perception
  • Verbal Behavior

Grants and funding

Authors: Zafar, Sadia Grant Number: HEQEP AIF Window 4, CP 3888 Funder: Higher Education Quality Enhancement Project for the Development of Multi-Platform Speech and Language Processing Software for Bangla Url: http://cse.sust.edu/bangla-nlp/ Authors: Shahidur, Sadia Grant Number: AS/2019/1/18 Funder: SUST Research Center Url:https://www.sust.edu/centers/research-center The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.