Perceptual learning for speech in noise after application of binary time-frequency masks

J Acoust Soc Am. 2013 Mar;133(3):1687-92. doi: 10.1121/1.4789896.

Abstract

Ideal time-frequency (TF) masks can reject noise and improve the recognition of speech-noise mixtures. An ideal TF mask is constructed with prior knowledge of the target speech signal. The intelligibility of a processed speech-noise mixture depends upon the threshold criterion used to define the TF mask. The study reported here assessed the effect of training on the recognition of speech in noise after processing by ideal TF masks that did not restore perfect speech intelligibility. Two groups of listeners with normal hearing listened to speech-noise mixtures processed by TF masks calculated with different threshold criteria. For each group, a threshold criterion that initially produced word recognition scores between 0.56-0.69 was chosen for training. Listeners practiced with one set of TF-masked sentences until their word recognition performance approached asymptote. Perceptual learning was quantified by comparing word-recognition scores in the first and last training sessions. Word recognition scores improved with practice for all listeners with the greatest improvement observed for the same materials used in training.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural

MeSH terms

  • Acoustic Stimulation
  • Adult
  • Audiometry, Speech
  • Auditory Threshold
  • Comprehension
  • Female
  • Humans
  • Learning*
  • Male
  • Noise / adverse effects*
  • Pattern Recognition, Physiological
  • Perceptual Masking*
  • Psychoacoustics
  • Recognition, Psychology
  • Sound Spectrography
  • Speech Intelligibility
  • Speech Perception*
  • Time Factors
  • Young Adult