5th European Conference on Speech Communication and Technology

Rhodes, Greece
September 22-25, 1997

On The Importance of Various Modulation Frequencies for Speech Recognition

Noboru Kanedera (1), Takayuki Arai (2), Hynek Hermansky (2), Misha Pavel (3)

(1) Oregon Graduate Institute of Science & Technology, Portland, Oregon, USA
(2) International Computer Science Institute, Berkeley, California, USA
(3) Ishikawa National College of Technology, Japan

Temporal processing of the time trajectories in the logarithmic spectrum domain, performed in cepstral mean subtraction, in computation of dynamic features in speech, or in RASTA processing, is becoming a common procedure in current ASR. Such temporal processing effectively enhances some components of the modulation spectrum of speech while suppressing others. It is therefore important to know the relative importance of various components of the modulation spectrum of speech. In this study we report on the effect of band-pass filtering of the time trajectories of spectral envelopes on speech recognition. Results indicate the relative importance of different components of the modulation spectrum of speech for ASR.

Full Paper

Bibliographic reference.  Kanedera, Noboru / Arai, Takayuki / Hermansky, Hynek / Pavel, Misha (1997): "On the importance of various modulation frequencies for speech recognition", In EUROSPEECH-1997, 1079-1082.