Exploring the feasibility of smart phone microphone for measurement of acoustic voice parameters and voice pathology screening

Eur Arch Otorhinolaryngol. 2015 Nov;272(11):3391-9. doi: 10.1007/s00405-015-3708-4. Epub 2015 Jul 11.

Abstract

The objective of this study is to evaluate the reliability of acoustic voice parameters obtained using smart phone (SP) microphones and investigate the utility of use of SP voice recordings for voice screening. Voice samples of sustained vowel/a/obtained from 118 subjects (34 normal and 84 pathological voices) were recorded simultaneously through two microphones: oral AKG Perception 220 microphone and SP Samsung Galaxy Note3 microphone. Acoustic voice signal data were measured for fundamental frequency, jitter and shimmer, normalized noise energy (NNE), signal to noise ratio and harmonic to noise ratio using Dr. Speech software. Discriminant analysis-based Correct Classification Rate (CCR) and Random Forest Classifier (RFC) based Equal Error Rate (EER) were used to evaluate the feasibility of acoustic voice parameters classifying normal and pathological voice classes. Lithuanian version of Glottal Function Index (LT_GFI) questionnaire was utilized for self-assessment of the severity of voice disorder. The correlations of acoustic voice parameters obtained with two types of microphones were statistically significant and strong (r = 0.73-1.0) for the entire measurements. When classifying into normal/pathological voice classes, the Oral-NNE revealed the CCR of 73.7% and the pair of SP-NNE and SP-shimmer parameters revealed CCR of 79.5%. However, fusion of the results obtained from SP voice recordings and GFI data provided the CCR of 84.60% and RFC revealed the EER of 7.9%, respectively. In conclusion, measurements of acoustic voice parameters using SP microphone were shown to be reliable in clinical settings demonstrating high CCR and low EER when distinguishing normal and pathological voice classes, and validated the suitability of the SP microphone signal for the task of automatic voice analysis and screening.

Keywords: Acoustic analysis; Smart phone; Voice screening.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Case-Control Studies
  • Discriminant Analysis
  • Feasibility Studies
  • Female
  • Humans
  • Male
  • Middle Aged
  • Reproducibility of Results
  • Smartphone*
  • Voice Disorders / diagnosis*
  • Voice Quality