13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

A signal-separation-based Array Postfilter for Distant Speech Recognition

Rita Singh (1), Kenichi Kumatani (2), John McDonough (1), Chen Liu (3)

(1) Carnegie Mellon University, Pittsburgh, PA, USA
(2) Disney Research, Pittsburgh, PA, USA
(3) Spansion Inc., Sunnyvale, CA, USA

In standard microphone array processing for distant speech recognition, the beamformed output is postfiltered to reduce residual noise. Postfiltering is usually performed through a weiner filter whose parameters are estimated from both the beamformer output and the signals captured at the microphones themselves. Conventional postfiltering methods assume diffuse or incoherent noise at the various microphones in order to estimate these parameters. When the noise does not conform to this assumption they perform poorly. We propose an alternate postfiltering mechanism that attenuates noise by estimating and separating out the contributions of speech and noise explicitly. Experiments on a corpus of in-car two-channel recordings show that the proposed postfiltering algorithm outperforms conventional postfilters significantly under many noise conditions.

Index Terms: Microphone arrays, postfiltering, beamforming, com- positional models, signal separation

Full Paper

Bibliographic reference.  Singh, Rita / Kumatani, Kenichi / McDonough, John / Liu, Chen (2012): "A signal-separation-based array postfilter for distant speech recognition", In INTERSPEECH-2012, 1934-1937.