EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology

Aalborg, Denmark
September 3-7, 2001


Robust Speech Recognition against Packet Loss

Manhung Siu, Yu-Chung Chan

Hong Kong University of Science and Technology, Hong Kong

Recognizing speech transmitted over mobile or computer networks poses new challenges such as packet loss in transmission. Viterbi algorithm, the most common speech recognition approach, seaches for the most likely state sequence that explains all observation. However, because it implicitly sums the log observation probabilities, the resulting solution is sensitive to outlier frames. In this paper, we propose a robust approach that searches the state sequence that best explains x percent of the observation and is insensitive to the corruption of a limited number of frames. We evaluated the proposed algorithm on the TI-digits task. With 10% of the data loss, the proposed algorithm achieves improvement of 71.6% for isolated digit recognition and 32.2% for connected digit recognition.

Full Paper

Bibliographic reference.  Siu, Manhung / Chan, Yu-Chung (2001): "Robust speech recognition against packet loss", In EUROSPEECH-2001, 1095-1098.