EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology

Aalborg, Denmark
September 3-7, 2001


Robust Speech Recognition Techniques Applied to a Speech in Noise Task

Richard C. Rose (1), Hong Kook Kim (1), Don Hindle (2)

(1) AT&T Labs - Research, USA; (2) AnswerLogic Inc., USA

This paper describes the design and evaluation of an automatic speech recognition (ASR) system on the Naval Research Laboratory Speech In Noise (SPINE) speech corpus. This corpus represents a task which involves human-human interaction on a constrained problem solving scenario under six different simulated noisy environments. Acoustic and language modeling were performed using a dataset taken entirely from a subset of the acoustic environments. Speech recognition was performed on continuous conversations by detecting speech utterances, performing acoustic feature analysis and normalization, and adapting HMM models in multiple passes over each conversation-side. The ASR word accuracy (WAC) ranged from 77 percent in an office environment to 61 percent in conditions that include significant levels of background speech and noise. An overall average WAC of 69.0 percent was obtained across all noise conditions.

Full Paper

Bibliographic reference.  Rose, Richard C. / Kim, Hong Kook / Hindle, Don (2001): "Robust speech recognition techniques applied to a speech in noise task", In EUROSPEECH-2001, 2351-2355.