13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Speaker Independent Single Channel Source Separation using Sinusoidal Features

Shivesh Ranjan (1), Karen L. Payton (1), Pejman Mowlaee (2)

(1) Electrical & Computer Engineering Dept., University of Massachusetts Dartmouth, North Dartmouth, MA, USA
(2) Institute of Communication Acoustics (IKA), Ruhr-Universität Bochum, Germany

Model-based approaches to achieve Single Channel Source Separation (SCSS) have been reasonably successful at separating two sources. However, most of the currently used model-based approaches require pre-trained speaker specific models in order to perform the separation. Often, insufficient or no prior training data may be available to develop such speaker specific models, necessitating the use of a speaker independent approach to SCSS. This paper proposes a speaker independent approach to SCSS using sinusoidal features. The algorithm develops speaker models for novel speakers from the speech mixtures under test, using prior training data available from other speakers. An iterative scheme improves the models with respect to the novel speakers present in the test mixtures. Experimental results indicate improved separation performance as measured by the Perceptual Evaluation of Speech Quality (PESQ) scores of the separated sources.

Index Terms: single channel, source separation, speaker independent, sinusoidal features

Full Paper

Bibliographic reference.  Ranjan, Shivesh / Payton, Karen L. / Mowlaee, Pejman (2012): "Speaker independent single channel source separation using sinusoidal features", In INTERSPEECH-2012, 1524-1527.