12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Candidate Generation for ASR Output Error Correction Using a Context-Dependent Syllable Cluster-Based Confusion Matrix

Chao-Hong Liu, Chung-Hsien Wu, David Sarwono, Jhing-Fa Wang

National Cheng Kung University, Taiwan

Error correction techniques have been proposed in the applications of language learning and spoken dialogue systems for spoken language understanding. These techniques include two consecutive stages: the generation of correction candidates and the selection of correction candidates. In this study, a Context-Dependent Syllable Cluster (CD-SC)-based Confusion Matrix is proposed for the generation of correction candidates. A Contextual Fitness Score, measuring the sequential relationship to the neighbors of the candidate, is proposed for corrected syllable sequence selection. Finally, the n-gram language model is used to determine the final word sequence output. Experiments show that the proposed method improved from 0.742 to 0.771 in terms of BLEU score as compared to the conventional speech recognition mechanism.

Full Paper

Bibliographic reference.  Liu, Chao-Hong / Wu, Chung-Hsien / Sarwono, David / Wang, Jhing-Fa (2011): "Candidate generation for ASR output error correction using a context-dependent syllable cluster-based confusion matrix", In INTERSPEECH-2011, 1633-1636.