EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology

Aalborg, Denmark
September 3-7, 2001


Robust Automatic Speech Recognition in Low-SNR Car Environments by the Application of a Connectionist Subspace-Based Approach to the Melbased Cepstral Coefficients

Sid-Ahmed Selouani, Hesham Tolba, Douglas O’Shaughnessy

INRS-Telecommunications, Canada

In this paper, the problem of robust continuous-speech recognition (CSR) in the presence of highly interfering car noise has been considered. Our approach is based on the noise reduction of the parameters that we use for recognition, that is, the Mel-based cepstral coefficients. This is achieved by the use of a Multilayer Perceptron (MLP) network for noise reduction in the cepstral domain in order to get less-variant parameters. Then, the obtained enhanced features are {it refined} via the Karhunen-Loève Transform (KLT) implemented using the Principal Component Analysis (PCA). Experiments show that the use of the enhanced parameters using such an approach increases the recognition rate of the CSR process in highly interfering car noise environments. Results show that the proposed hybrid technique when included in the front-end of an HTK-based CSR system, outperforms that of the conventional recognition process based on either a KLT- or an MLP-based preprocessing recognition in severe interfering car noise environments for a wide range of SNRs varying from 16 dB to -4 dB using a noisy version of the TIMIT database.

Full Paper

Bibliographic reference.  Selouani, Sid-Ahmed / Tolba, Hesham / O’Shaughnessy, Douglas (2001): "Robust automatic speech recognition in low-SNR car environments by the application of a connectionist subspace-based approach to the melbased cepstral coefficients", In EUROSPEECH-2001, 1577-1580.