Autocorrelation factors and intelligibility of Japanese monosyllables in individuals with sensorineural hearing loss

J Acoust Soc Am. 2017 Feb;141(2):1065. doi: 10.1121/1.4976064.

Abstract

Some Japanese monosyllables contain consonants that are not easily discernible for individuals with sensorineural hearing loss. However, the acoustic features that make these monosyllables difficult to discern have not been clearly identified. Here, this study used the autocorrelation function (ACF), which can capture temporal features of signals, to clarify the factors influencing speech intelligibility. For each monosyllable, five factors extracted from the ACF [Φ(0): total energy; τ1 and ϕ1: delay time and amplitude of the maximum peak; τe: effective duration; Wϕ(0): spectral centroid], voice onset time, speech intelligibility index, and loudness level were compared with the percentage of correctly perceived articulations (144 ears) obtained by 50 Japanese vowel and consonant-vowel monosyllables produced by one female speaker. Results showed that median effective duration [(τe)med] was strongly correlated with the percentage of correctly perceived articulations of the consonants (r = 0.87, p < 0.01). (τe)med values were computed by running ACFs with the time lag at which the magnitude of the logarithmic-ACF envelope had decayed to -10 dB. Effective duration is a measure of temporal pattern persistence, i.e., the duration over which the waveform maintains a stable pattern. The authors postulate that low recognition ability is related to degraded perception of temporal fluctuation patterns.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Acoustic Stimulation
  • Audiometry, Speech
  • Cues
  • Female
  • Hearing Loss, Sensorineural / diagnosis
  • Hearing Loss, Sensorineural / physiopathology
  • Hearing Loss, Sensorineural / psychology*
  • Humans
  • Japan
  • Language*
  • Male
  • Persons With Hearing Impairments / psychology*
  • Speech Intelligibility*
  • Speech Perception*
  • Time Factors
  • Voice Quality*