Multidirectional regression (MDR)-based features for automatic voice disorder detection

Ghulam Muhammad; Tamer A Mesallam; Khalid H Malki; Mohamed Farahat; Awais Mahmood; Mansour Alsulaiman

doi:10.1016/j.jvoice.2012.05.002

Multidirectional regression (MDR)-based features for automatic voice disorder detection

J Voice. 2012 Nov;26(6):817.e19-27. doi: 10.1016/j.jvoice.2012.05.002.

Authors

Ghulam Muhammad¹, Tamer A Mesallam, Khalid H Malki, Mohamed Farahat, Awais Mahmood, Mansour Alsulaiman

Affiliation

¹ Department of Computer Engineering, College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia. ghulam@ksu.edu.sa

PMID: 23177748
DOI: 10.1016/j.jvoice.2012.05.002

Abstract

Background and objective: Objective assessment of voice pathology has a growing interest nowadays. Automatic speech/speaker recognition (ASR) systems are commonly deployed in voice pathology detection. The aim of this work was to develop a novel feature extraction method for ASR that incorporates distributions of voiced and unvoiced parts, and voice onset and offset characteristics in a time-frequency domain to detect voice pathology.

Materials and methods: The speech samples of 70 dysphonic patients with six different types of voice disorders and 50 normal subjects were analyzed. The Arabic spoken digits (1-10) were taken as an input. The proposed feature extraction method was embedded into the ASR system with Gaussian mixture model (GMM) classifier to detect voice disorder.

Results: Accuracy of 97.48% was obtained in text independent (all digits' training) case, and over 99% accuracy was obtained in text dependent (separate digit's training) case. The proposed method outperformed the conventional Mel frequency cepstral coefficient (MFCC) features.

Conclusion: The results of this study revealed that incorporating voice onset and offset information leads to efficient automatic voice disordered detection.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Acoustics*
Adolescent
Adult
Algorithms
Automation
Case-Control Studies
Female
Humans
Linear Models
Male
Middle Aged
Models, Statistical*
Pattern Recognition, Automated
Predictive Value of Tests
Signal Processing, Computer-Assisted*
Sound Spectrography
Speech Acoustics*
Speech Production Measurement*
Time Factors
Voice Disorders / diagnosis*
Voice Disorders / physiopathology
Voice Quality*
Young Adult