12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Generalized Method for Solving the Permutation Problem in Frequency-Domain Blind Source Separation of Convolved Speech Signals

Auxiliadora Sarmiento, Iván Durán, Sergio Cruces, Pablo Aguilera

Universidad de Sevilla, Spain

The blind speech separation of convolutive mixtures can be performed in the time-frequency domain. The separation problem becomes to a set of instantaneous mixing problems, one for each frequency bin, that can be solved independently by any appropriate instantaneous ICA algorithm. However, the arbitrary order of the estimated sources in each frequency, known as permutation problem, has to be solved to successfully recover the original sources. This paper deals with the permutation problem in the general case of N sources and N observations. The proposed method combines a correlation approach based on the amplitude correlation property of speech signals, and an optimal pairing scheme to align the permuted solutions. Our method is robust to artificially permuted speech signals. Experimental results on simulated convolutive mixtures show the effectiveness of the proposed method in terms of quality of separated signals by objective and perceptual measures.

Full Paper

Bibliographic reference.  Sarmiento, Auxiliadora / Durán, Iván / Cruces, Sergio / Aguilera, Pablo (2011): "Generalized method for solving the permutation problem in frequency-domain blind source separation of convolved speech signals", In INTERSPEECH-2011, 565-568.