On the compromise between noise reduction and speech/noise spatial information preservation in binaural speech enhancement

J Acoust Soc Am. 2021 May;149(5):3151. doi: 10.1121/10.0004854.

Abstract

Spatial information is important for human perception of speech and sound signals. However, this information is often either distorted or completely neglected in noise reduction because it is challenging, to say the least, to achieve optimal noise reduction and accurate spatial information preservation at the same time. This paper studies the problem of binaural speech enhancement. By jointly diagonalizing the speech and noise correlation matrices, we present a method to construct the noise reduction filter as a linear combination of different eigenvectors, which span a certain subspace of the entire space. A different dimension of the subspace gives a different trade-off between noise reduction and speech/noise spatial information preservation. On the one side, if the dimension is equal to 1, maximum noise reduction is achieved but at the price of significant spatial information distortion. On the other extreme, if the dimension of the subspace is equal to that of the entire space, spatial information is accurately preserved but at the cost of no noise reduction. Therefore, one can achieve different levels of compromises between the amount of noise reduction and the level of speech/noise spatial information preservation by adjusting the dimension of the used subspace.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Humans
  • Noise / adverse effects
  • Sound
  • Sound Localization*
  • Speech
  • Speech Perception*