Second European Conference on Speech Communication and Technology

Genova, Italy
September 24-26, 1991


Entropic Training for HMM Speech Recognition

Antonio M. Peinado, Ramon Roman, Jose C. Segura, Antonio J. Rubio, Pedro Garcia, Jesus E. Diaz

Departamento de Electronica y Tecnologia de Computadores, Universidad de Granada, Granada, Spain

The segmentation of the training data has become the most used initialization method for HMM training. The Viterbi reestimation has been widely applied for that purpose. We introduce a new segmentation method, based on the maximization of the Entropic Cohesion Measure (ECM) between segments and observations, which is equivalent to minimize the entropy model. This maximization is carried out by looking for the optimal boundaries between segments of the training utterances. Thus, we obtain an optimal segmentation (in the ECM sense) that achieves similar performance to the Viterbi reestimation with a considerable computational saving.

Full Paper

Bibliographic reference.  Peinado, Antonio M. / Roman, Ramon / Segura, Jose C. / Rubio, Antonio J. / Garcia, Pedro / Diaz, Jesus E. (1991): "Entropic training for HMM speech recognition", In EUROSPEECH-1991, 651-654.