13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

A Correlational Discriminant Approach to Feature Extraction for Robust Speech Recognition

Vikrant Singh Tomar, Richard C. Rose

Department of Electrical and Computer Engineering, McGill University, Montreal, QC, Canada

A non-linear discriminant analysis based approach to feature space dimensionality reduction in noise robust automatic speech recognition (ASR) is proposed. It utilizes a correlation based distance measure instead of the conventional Euclidean distance. The use of this "correlation preserving discriminant analysis" (CPDA) procedure is motivated by evidence suggesting that correlation based cepstrum distance measures can be more robust than Euclidean based distances when speech is corrupted by noise. The performance of CPDA is evaluated in terms of the word error rate obtained by using CPDA derived features on the Aurora 2 speech in noise corpus, and is compared to the commonly used linear discriminant analysis (LDA) approach to feature space transformations.

Index Terms: Correlation preserving discriminant analysis, graph embedding, dimensionality reduction, speech recognition

Full Paper

Bibliographic reference.  Tomar, Vikrant Singh / Rose, Richard C. (2012): "A correlational discriminant approach to feature extraction for robust speech recognition", In INTERSPEECH-2012, 555-558.