14thAnnual Conference of the International Speech Communication Association

Lyon, France
August 25-29, 2013

Discriminative Nonnegative Dictionary Learning Using Cross-Coherence Penalties for Single Channel Source Separation

Emad M. Grais, Hakan Erdogan

Sabancı Üniversitesi, Turkey

In this work, we introduce a new discriminative training method for nonnegative dictionary learning. The new method can be used in single channel source separation (SCSS) applications. In SCSS, nonnegative matrix factorization (NMF) is used to learn a dictionary (a set of basis vectors) for each source in the magnitude spectrum domain. The trained dictionaries are then used in decomposing the mixed signal to find the estimate for each source. Learning discriminative dictionaries for the source signals can improve the separation performance. To achieve discriminative dictionaries, we try to avoid the bases set of one source dictionary from representing the other source signals. We propose to minimize cross-coherence between the dictionaries of all sources in the mixed signal. We incorporate a simplified cross-coherence penalty using a regularized NMF cost function to simultaneously learn discriminative and reconstructive dictionaries. The new regularized NMF update rules that are used to discriminatively train the dictionaries are introduced in this work. Experimental results show that using discriminative training gives better separation results than using conventional NMF.

Full Paper

Bibliographic reference.  Grais, Emad M. / Erdogan, Hakan (2013): "Discriminative nonnegative dictionary learning using cross-coherence penalties for single channel source separation", In INTERSPEECH-2013, 808-812.