INTERSPEECH 2013
14thAnnual Conference of the International Speech Communication Association

Lyon, France
August 25-29, 2013

Interference Robust DOA Estimation of Human Speech by Exploiting Historical Information and Temporal Correlation

Wei Xue, Shan Liang, Wenju Liu

Chinese Academy of Sciences, China

Although various DOA estimation methods for human speech have been presented, most of them assume noises received by different microphones are undirected. However, strong directional interferences often also exist in practical scenarios and the performances of existing methods degrade seriously in such case. In this paper, we present a new interference robust DOA estimation method for human speech. Historical information and temporal correlation are taken advantage to deal with the problem. Firstly, utilizing the historical DOA estimates, we perform "post-beamforming" in the last frame to suppress the directional interferences. Then exploiting temporal correlation of speech spectra, frequency weights which highlight the effects of speech frequency bins are calculated based on the estimated a priori SNR of enhanced signal. Finally, we propose a new DOA cost function using frequency-weighted spatial correlation matrix to estimate the DOA of speech source. Experimental results show that the proposed method outperforms existing algorithms in reverberant environments with additive white Gaussian noises in the presence of different kinds of interferences.

Full Paper

Bibliographic reference.  Xue, Wei / Liang, Shan / Liu, Wenju (2013): "Interference robust DOA estimation of human speech by exploiting historical information and temporal correlation", In INTERSPEECH-2013, 2895-2899.