Fourth European Conference on Speech Communication and Technology

Madrid, Spain
September 18-21, 1995

Temporal Correlation Modeling in a Hybrid Neural Network/Hidden Markov Model Speech Recognizer

Horatio Franco, Vassilios Digalakis

SRI International, Speech Technology and Research Laboratory, Menlo Park, CA, USA

A new scheme to overcome the independence assumption in standard hidden Markov modeling (HMM) formulations is presented within the framework of a hybrid system that uses a discriminatively trained multilayer perceptron (MLP) to compute a correlated emission probability. The scheme takes advantage of the MLP's ability to model correlations across multiple frames allowing the use of multiframe long vector history to condition the emission probability. The required number of parameters is the same as in the standard hybrid HMM/MLP formulation. Results presented in a large vocabulary continuous speech recognition task show that even though performance so far has not improved over the standard approach, the acoustic and language model probabilities are better balanced with this new scheme as compared to the standard one.

Full Paper

Bibliographic reference.  Franco, Horatio / Digalakis, Vassilios (1995): "Temporal correlation modeling in a hybrid neural network/hidden Markov model speech recognizer", In EUROSPEECH-1995, 1681-1684.