Second European Conference on Speech Communication and Technology

Genova, Italy
September 24-26, 1991


A Comparative Study of Parameters and Distances for Noisy Speech Recognition

Javier Hernando, Climent Nadeu

Dept. of Signal Theory and Communications, Universitat Politecnica de Catalunya, Barcelona, Catalonia, Spain

Speech recognition in noisy environments remains an unsolved problem even in the case of isolated word recognition with small vocabularies. Recently, several techniques have been proposed to alleviate this problem. Concretely, the Short-Time Modified Coherence (SMC) parameterization and the Cepstral Projection Distortion (CPD) measure have shown excellent results when tested in a speech recognition system based on Dynamic Time Warping (DTW) and using speech contaminated by additive white noise. In this paper, a new technique based on the AR modeling of the one-sided autocorrelation sequence (OS ALPC) is presented and, from a comparative study of these LPC-based techniques in the Hidden Markov Model (HMM) approach, two main conclusions are attained: 1) the slope cepstral window and a relatively high model order are preferable, and 2) the cepstral representation based on the autocorrelation (rather on the signal) modeling achieves excellent results.

Full Paper

Bibliographic reference.  Hernando, Javier / Nadeu, Climent (1991): "A comparative study of parameters and distances for noisy speech recognition", In EUROSPEECH-1991, 91-94.