13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Combination of Sparse Classification and Multilayer Perceptron for Noise-robust ASR

Yang Sun (1,3), Mathew M. Doss (3), Jort F. Gemmeke (2), Bert Cranen (1), Louis ten Bosch (1), Lou Boves (1)

1Centre for Language and (S) peech Technology, Radboud University Nijmegen, the Netherlands
(2) Department ESAT, KU Leuven, Belgium
(3) Idiap Research Institute, Martigny, Switzerland

On the AURORA-2 task good results at low SNR levels have been obtained with a system that uses state posterior estimates provided by an exemplar-based sparse classification (SC) system. At the same time, posterior estimates obtained with multilayer perceptron (MLP) yield good results at high SNRs. In this paper, we investigate the effect of combining the estimates from the SC and MLP systems at the probability level. More precisely, the probabilities are combined by a sum rule or a product rule using static and inverse-entropy based dynamic weights. In addition, we investigate a modified dynamic weighting approach which enhances the contribution of SC stream based on the information about static weights and average dynamic weights obtained on cross validation data. Our studies on AURORA-2 task shows that in all conditions the modified dynamic weighting approach yields a dual-input system that performs better than or equal to the best stand-alone system.

Full Paper

Bibliographic reference.  Sun, Yang / Doss, Mathew M. / Gemmeke, Jort F. / Cranen, Bert / Bosch, Louis ten / Boves, Lou (2012): "Combination of sparse classification and multilayer perceptron for noise-robust ASR", In INTERSPEECH-2012, 310-313.