4th International Conference on Spoken Language Processing

Philadelphia, PA, USA
October 3-6, 1996

Improvement in N-Best Search for Continuous Speech Recognition

Irina Illina, Yifan Gong

CRIN/CNRS, INRIA-Lorraine, Vandoeuvre-les-Nancy, France

In this paper, several techniques for reducing the search complexity of beam search for continuous speech recognition task are proposed. Six heuristic methods for pruning are described and the parameters of the pruning are adjusted to keep constant the word error rate while reducing the computational complexity and memory demand. The evaluation of the effect of each pruning method is performed in Mixture Stochastic Trajectory Model (MSTM). MSTM is a segment-based model using phonemes as the speech units. The set of tests in a speaker-dependent continuous speech recognition task shows that using the pruning methods, a substantial reduction of 67% of search effort is obtained in term of number of hypothesised phonemes during the search. All proposed techniques are independent of the acoustic models and therefore are applicable to other acoustic modeling techniques.

Full Paper

Bibliographic reference.  Illina, Irina / Gong, Yifan (1996): "Improvement in n-best search for continuous speech recognition", In ICSLP-1996, 2147-2150.