Ninth International Conference on Spoken Language Processing

Pittsburgh, PA, USA
September 17-21, 2006

Physiologically-Motivated Synchrony-Based Processing for Robust Automatic Speech Recognition

Chanwoo Kim, Yu-Hsiang Chiu, Richard M. Stern

Carnegie Mellon University, USA

This paper describes the structure and performance of a new signal processing scheme, motivated by the physiology of the peripheral auditory system, that improves speech recognition accuracy in the presence of broadband noise. An important attribute of the peripheral processing is a novel mechanism to represent the cycle-by-cycle synchrony in the response of low-frequency auditory-nerve fibers, in addition to the more conventional processing based on mean rate of response. It is shown that the use of the physiologically-motivated peripheral processing improves recognition accuracy in the presence of both broadband and transient noise, and that the use of the synchrony mechanism provides further improvement beyond that which is provided by the mean rate mechanism.

Full Paper

Bibliographic reference.  Kim, Chanwoo / Chiu, Yu-Hsiang / Stern, Richard M. (2006): "Physiologically-motivated synchrony-based processing for robust automatic speech recognition", In INTERSPEECH-2006, paper 1975-Wed1A2O.2.