Multiple Sound Source Localization with SVD-PHAT

Fran├žois Grondin, James Glass

This paper introduces a modification of phase transform on singular value decomposition (SVD-PHAT) to localize multiple sound sources. This work aims to improve localization accuracy and keeps the algorithm complexity low for real-time applications. This method relies on multiple scans of the search space, with projection of each low-dimensional observation onto orthogonal subspaces. We show that this method localizes multiple sound sources more accurately than discrete SRP-PHAT, with a reduction in the Root Mean Square Error up to 0.0395 radians.

 DOI: 10.21437/Interspeech.2019-2653

Cite as: Grondin, F., Glass, J. (2019) Multiple Sound Source Localization with SVD-PHAT. Proc. Interspeech 2019, 2698-2702, DOI: 10.21437/Interspeech.2019-2653.

  author={Fran├žois Grondin and James Glass},
  title={{Multiple Sound Source Localization with SVD-PHAT}},
  booktitle={Proc. Interspeech 2019},