EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology
2nd INTERSPEECH Event

Aalborg, Denmark
September 3-7, 2001

                 

Efficient Scalable Speech Compression for Scalable Speech Recognition

Naveen Srinivasamurthy, Antonio Ortega, Shrikanth Narayanan

University of Southern California, USA

We propose a scalable recognition system for reducing recognition complexity. Scalable recognition can be combined with scalable compression in a distributed speech recognition (DSR) application to reduce both the computational load and the bandwidth requirement at the server. A low complexity pre-processor is used to eliminate the unlikely classes so that the complex recognizer can use the reduced subset of classes to recognize the unknown utterance. It is shown that by using our system it is fairly straightforward to trade-off reductions in complexity for performance degradation. Results of preliminary experiments using the TI-46 word digit database show that the proposed scalable approach can provide a 40% speed up, while operating under 1.05 kbps, compared to the baseline recognition using uncompressed speech.

Full Paper

Bibliographic reference.  Srinivasamurthy, Naveen / Ortega, Antonio / Narayanan, Shrikanth (2001): "Efficient scalable speech compression for scalable speech recognition", In EUROSPEECH-2001, 1845-1848.