12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

New Methods for Template Selection and Compression in Continuous Speech Recognition

Xie Sun, Yunxin Zhao

University of Missouri, USA

We propose a maximum likelihood method for selecting template representatives, and in order to include more information in the selected template representatives, we further propose to create compressed template representatives by Gaussian mixture model (GMM) merging algorithm. A Kullback-Leibler (KL) divergence based local distance is proposed for Dynamic Time Warping (DTW) in template matching. Experimental results on the tasks of TIMIT phone recognition and large vocabulary continuous speech recognition demonstrated that the proposed template selection method significantly improved the recognition accuracy over the HMM baseline while only 5% or 10% templates were selected from the total templates, and the template compression method has provided further recognition accuracy gains over the template selection method.

Full Paper

Bibliographic reference.  Sun, Xie / Zhao, Yunxin (2011): "New methods for template selection and compression in continuous speech recognition", In INTERSPEECH-2011, 985-988.