Configuration-Invariant Sound Localization Technique Using Azimuth-Frequency Representation and Convolutional Neural Networks

Chanjun Chun; Kwang Myung Jeon; Wooyeol Choi

doi:10.3390/s20133768

Configuration-Invariant Sound Localization Technique Using Azimuth-Frequency Representation and Convolutional Neural Networks

Sensors (Basel). 2020 Jul 5;20(13):3768. doi: 10.3390/s20133768.

Authors

Chanjun Chun¹, Kwang Myung Jeon², Wooyeol Choi³

Affiliations

¹ Infrastructure Research Center, Korea Institute of Civil Engineering and Building Technology, Goyang 10223, Korea.
² IntFlow Co., Ltd, Gwangju 61080, Korea..
³ Department of Computer Engineering, Chosun University, Gwangju 61452, Korea.

Abstract

Deep neural networks (DNNs) have achieved significant advancements in speech processing, and numerous types of DNN architectures have been proposed in the field of sound localization. When a DNN model is deployed for sound localization, a fixed input size is required. This is generally determined by the number of microphones, the fast Fourier transform size, and the frame size. if the numbers or configurations of the microphones change, the DNN model should be retrained because the size of the input features changes. in this paper, we propose a configuration-invariant sound localization technique using the azimuth-frequency representation and convolutional neural networks (CNNs). the proposed CNN model receives the azimuth-frequency representation instead of time-frequency features as the input features. the proposed model was evaluated in different environments from the microphone configuration in which it was originally trained. for evaluation, single sound source is simulated using the image method. Through the evaluations, it was confirmed that the localization performance was superior to the conventional steered response power phase transform (SRP-PHAT) and multiple signal classification (MUSIC) methods.

Keywords: azimuth-frequency representation; configuration-invariant; convolutional neural network (CNN); sound localization.

Publication types

Letter

Abstract

Publication types

Grants and funding