Fast Sound Source Localization Using Two-Level Search Space Clustering

IEEE Trans Cybern. 2016 Jan;46(1):20-6. doi: 10.1109/TCYB.2015.2391252. Epub 2015 Feb 11.

Abstract

Steered response power phase transform (SRP-PHAT) is a method that is widely used for robust sound source localization (SSL). However, since SRP-PHAT searches over a large number of candidate locations, it is too slow to run in real-time for large-scale microphone array systems. In this paper, we propose a robust two-level search space clustering method to speed-up SRP-PHAT-based SSL. The proposed method divides the candidate locations of the sound source into a set of groups and finds a small number of groups that are likely to contain the maximum power location. By searching within the small number of groups, the computational costs are reduced by 61.8% compared to a previously proposed method without loss of accuracy.

Publication types

  • Research Support, Non-U.S. Gov't