Third International Conference on Spoken Language Processing (ICSLP 94)

Yokohama, Japan
September 18-22, 1994

A Continuous HMM Based Preprocessor for Modular Speech Recognition Neural Networks

Fikret S. Gurgen, J. M. Song, R. W. King

Sydney University, Electrical Eng. Dept, NSW, Australia

This paper uses a speech recognition method which combines a continuous density hidden Markov model (CDHMM)-based preprocessor with window-based neural network (WNN) architectures. The method also employs modularity of NNs. It removes fixed-sized input constraint of NN and improves recognition performance. The CDHMM preprocessor performs a priori Gaussian shaping and normalization using statistically modelled state vectors in contrast to simple distance metric between acoustic vectors in dynamic time warping (DTW) preprocessor. Then, normalized and a priori Gaussian shaped speech features are applied as input to WNN and modular WNN architectures. NIST TI-46 E-set experiments are performed and the results are compared with a baseline CDHMM results. The proposed system improves the recognition performance. Modular WNN provides further significant improvement on the performance.

Full Paper

Bibliographic reference.  Gurgen, Fikret S. / Song, J. M. / King, R. W. (1994): "A continuous HMM based preprocessor for modular speech recognition neural networks", In ICSLP-1994, 1507-1510.