Reference-Free Assessment of Speech Intelligibility Using Bispectrum of an Auditory Neurogram

PLoS One. 2016 Mar 11;11(3):e0150415. doi: 10.1371/journal.pone.0150415. eCollection 2016.

Abstract

Sensorineural hearing loss occurs due to damage to the inner and outer hair cells of the peripheral auditory system. Hearing loss can cause decreases in audibility, dynamic range, frequency and temporal resolution of the auditory system, and all of these effects are known to affect speech intelligibility. In this study, a new reference-free speech intelligibility metric is proposed using 2-D neurograms constructed from the output of a computational model of the auditory periphery. The responses of the auditory-nerve fibers with a wide range of characteristic frequencies were simulated to construct neurograms. The features of the neurograms were extracted using third-order statistics referred to as bispectrum. The phase coupling of neurogram bispectrum provides a unique insight for the presence (or deficit) of supra-threshold nonlinearities beyond audibility for listeners with normal hearing (or hearing loss). The speech intelligibility scores predicted by the proposed method were compared to the behavioral scores for listeners with normal hearing and hearing loss both in quiet and under noisy background conditions. The results were also compared to the performance of some existing methods. The predicted results showed a good fit with a small error suggesting that the subjective scores can be estimated reliably using the proposed neural-response-based metric. The proposed metric also had a wide dynamic range, and the predicted scores were well-separated as a function of hearing loss. The proposed metric successfully captures the effects of hearing loss and supra-threshold nonlinearities on speech intelligibility. This metric could be applied to evaluate the performance of various speech-processing algorithms designed for hearing aids and cochlear implants.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Audiometry, Speech / methods*
  • Auditory Threshold
  • Cochlear Nerve / physiopathology
  • Hearing Loss, Sensorineural / physiopathology*
  • Humans
  • Models, Theoretical
  • Speech Intelligibility*
  • Speech Perception

Grants and funding

This study was supported by the grants UM.C/625/1/HIR/152 and RP016B-13AET. The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.