The Method of Random Directions Optimization for Stereo Audio Source Separation

Oleg Golokolenko, Gerald Schuller


In this paper, a novel fast time domain audio source separation technique based on fractional delay filters with low computational complexity and small algorithmic delay is presented and evaluated in experiments. Our goal is a Blind Source Separation (BSS) technique, which can be applicable for the low cost and low power devices where processing is done in real-time, e.g. hearing aids or teleconferencing setups. The proposed approach optimizes fractional delays implemented as IIR filters and attenuation factors between microphone signals to minimize crosstalk, the principle of a fractional delay and sum beamformer. The experiments have been carried out for offline separation with stationary sound sources and for real-time with randomly moving sound sources. Experimental results show that separation performance of the proposed time domain BSS technique is competitive with State-of-the-Art (SoA) approaches but has lower computational complexity and no system delay like in frequency domain BSS.


 DOI: 10.21437/Interspeech.2020-1409

Cite as: Golokolenko, O., Schuller, G. (2020) The Method of Random Directions Optimization for Stereo Audio Source Separation. Proc. Interspeech 2020, 3316-3320, DOI: 10.21437/Interspeech.2020-1409.


@inproceedings{Golokolenko2020,
  author={Oleg Golokolenko and Gerald Schuller},
  title={{The Method of Random Directions Optimization for Stereo Audio Source Separation}},
  year=2020,
  booktitle={Proc. Interspeech 2020},
  pages={3316--3320},
  doi={10.21437/Interspeech.2020-1409},
  url={http://dx.doi.org/10.21437/Interspeech.2020-1409}
}