4th International Conference on Spoken Language Processing

Philadelphia, PA, USA
October 3-6, 1996

Discriminative Optimisation of Large Vocabulary Recognition Systems

V. Valtchev, P. C. Woodland, S. J. Young

Cambridge University Engineering Department, Cambridge, UK

This paper describes a framework for optimising the structure and parameters of a continuous density HMM-based large vocabulary recognition system using the Maximum Mutual Information Estimation (MMIE) criterion. To reduce the computational complexity of the MMIE training algorithm, confusable segments of speech are identified and stored as word lattices of alternative utterance hypotheses. An iterative mixture splitting procedure is also employed to adjust the number of mixture components in each state during training such that the optimal balance between number of parameters and available training data is achieved. Experiments are presented on various test sets from the Wall Street Journal database using the full SI-284 training set. These show that the use of lattices makes MMIE training practicable for very complex recognition systems and large training sets. Furthermore, experimental results demonstrate that MMIE optimisation of system structure and parameters can yield useful increases in recognition accuracy.

Full Paper

Bibliographic reference.  Valtchev, V. / Woodland, P. C. / Young, S. J. (1996): "Discriminative optimisation of large vocabulary recognition systems", In ICSLP-1996, 18-21.