13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Uncertainty-driven Compensation of Multi-Stream MLP Acoustic Models for Robust ASR Ramon

Ramón Fernandez Astudillo, Alberto Abad, João Paulo da Silva Neto

Spoken Language Laboratory, INESC-ID-Lisboa, Lisboa, Portugal

In this paper we show how the robustness of multi-stream multi-layer perceptron (MLP) acoustic models can be increased through uncertainty propagation and decoding. We demonstrate that MLP uncertainty decoding yields consistent improvements over using minimum mean square error (MMSE) feature enhancement in MFCC and RASTA-LPCC domains. We introduce as well formulas for the computation of the uncertainty associated to the acoustic likelihood computation and explore different stream integration schemes using this uncertainty on the AURORA4 corpus.

Index Terms: uncertainty propagation, observation uncertainty, MLP, multi-stream

Full Paper

Bibliographic reference.  Fernandez Astudillo, Ramón / Abad, Alberto / Neto, João Paulo da Silva (2012): "Uncertainty-driven compensation of multi-stream MLP acoustic models for robust ASR ramon", In INTERSPEECH-2012, 2606-2609.