EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology

Aalborg, Denmark
September 3-7, 2001


On the Choice of Classes in MCE based discriminative HMM-Training for Speech Recognizers used in the Telephone Environment

Josef G. Bauer

Siemens, Germany

One of the most commonly used discriminative approaches in parameter estimation for Hidden Markov Models is the Minimum Classification Error (MCE) method . This paper studies possible choices for the classes (i.e. basic speech units) in MCE training and their application for several tasks suitable for speech driven dialog systems in the telephone environment. The considered choices of classes are HMM states, phonemes, words and sequences of words. The theoretical suitability and practical considerations for the different criteria are discussed. Using the different training criteria consistent experimental results are given for four tasks: non-task-specific training, training for small vocabulary isolated word recognition, training for connected digit recognition and for letter recognition. In all experiments not only the objective of the optimization but also the resulting word recognition performance is investigated. It shows that for the given setup only word and word string based criteria are capable to reduce the word error rate.

Full Paper

Bibliographic reference.  Bauer, Josef G. (2001): "On the choice of classes in MCE based discriminative HMM-training for speech recognizers used in the telephone environment", In EUROSPEECH-2001, 1633-1636.