Sparse Nonnegative Matrix Factorization Strategy for Cochlear Implants

Trends Hear. 2015 Dec 30:19:2331216515616941. doi: 10.1177/2331216515616941.

Abstract

Current cochlear implant (CI) strategies carry speech information via the waveform envelope in frequency subbands. CIs require efficient speech processing to maximize information transfer to the brain, especially in background noise, where the speech envelope is not robust to noise interference. In such conditions, the envelope, after decomposition into frequency bands, may be enhanced by sparse transformations, such as nonnegative matrix factorization (NMF). Here, a novel CI processing algorithm is described, which works by applying NMF to the envelope matrix (envelopogram) of 22 frequency channels in order to improve performance in noisy environments. It is evaluated for speech in eight-talker babble noise. The critical sparsity constraint parameter was first tuned using objective measures and then evaluated with subjective speech perception experiments for both normal hearing and CI subjects. Results from vocoder simulations with 10 normal hearing subjects showed that the algorithm significantly enhances speech intelligibility with the selected sparsity constraints. Results from eight CI subjects showed no significant overall improvement compared with the standard advanced combination encoder algorithm, but a trend toward improvement of word identification of about 10 percentage points at +15 dB signal-to-noise ratio (SNR) was observed in the eight CI subjects. Additionally, a considerable reduction of the spread of speech perception performance from 40% to 93% for advanced combination encoder to 80% to 100% for the suggested NMF coding strategy was observed.

Keywords: babble noise; cochlear implant; nonnegative matrix factorization; speech enhancement; speech recognition; vocoder.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Acoustic Stimulation / methods
  • Adult
  • Aged
  • Aged, 80 and over
  • Algorithms*
  • Cochlear Implantation / methods
  • Cochlear Implants*
  • Cohort Studies
  • Female
  • Humans
  • Male
  • Middle Aged
  • Prosthesis Design
  • Signal Processing, Computer-Assisted*
  • Signal-To-Noise Ratio
  • Sound Spectrography / methods
  • Speech Perception / physiology*
  • Speech Recognition Software*