EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology
2nd INTERSPEECH Event

Aalborg, Denmark
September 3-7, 2001

                 

Evaluation of Front-End Features and Noise Compensation Methods for Robust Mandarin Speech Recognition

Rathi Chengalvarayan

Lucent Technologies, USA

This paper describes speaker-independent speech recognition experiments concerning acoustic front-end processing on a telephone database that was recorded in various dialect regions in China. In this paper, three different features based on human voice production, perception and auditory systems have been evaluated for Mandarin speech recognition. Experimental comparisons showed that auditory-filtered cepstral coefficients outperforms the other type of features. When speech recognizers are deployed in telephone services, they often encounter variable acoustic mismatches which significantly deteriorate their performance. Three different channel equalization techniques have been explored in this study to decrease this mismatch, hence improving the recognition accuracy. We present results with various noise compensation methods based on hierarchical cepstral mean subtaction and signal bias removal.

Full Paper

Bibliographic reference.  Chengalvarayan, Rathi (2001): "Evaluation of front-end features and noise compensation methods for robust Mandarin speech recognition", In EUROSPEECH-2001, 897-900.