4th International Conference on Spoken Language Processing

Philadelphia, PA, USA
October 3-6, 1996

Robust Speech Recognition Features Based on Temporal Trajectory Filtering of Frequency Band Spectrum

Jia-lin Shen (1), Wen-liang Hwang (2), Lin-shan Lee (1,2)

(1) Dept. of Electrical Engineering, National Taiwan University
(2) Institute of Information Science, Academia Sinica Taipei, Taiwan

This paper presents the use of a variety of filters in the temporal trajectories of frequency band spectrum to extract speech recognition features for environmental robustness. Three kind of filters for emphasizing the statistically important parts of speech are proposed. First, a bank of RASTA-like band-pass filters to fit the statistical peaks of modulation frequency band spectrum of speech are used. Secondly, a three-channel octave band-filter band with a smoothed rectangular window spline is applied. Thirdly, a data- driven filter is developed. Experimental results show that significant improvements for speech recognition using the proposed feature extraction approach under noisy environments can be achieved.

Full Paper

Bibliographic reference.  Shen, Jia-lin / Hwang, Wen-liang / Lee, Lin-shan (1996): "Robust speech recognition features based on temporal trajectory filtering of frequency band spectrum", In ICSLP-1996, 881-884.