4th International Conference on Spoken Language Processing
Philadelphia, PA, USA
In this paper we present a new technique in the stochastic matching framework to compensate for non-linear distortions in speech recognition. The features of the test data and the means of the trained model are both transformed using neural networks to better fit each other. The parameters of the neural network are estimated using a novel combination of the generalized EM (GEM) and the back-propagation algorithms. In the feature transformation case, when the exact Q-functions cannot be calculated, approximations are heuristically derived. The mathematical properties of the new algorithm are analyzed. The performance of the algorithm is also studied under different mismatch conditions.
Bibliographic reference. Surendran, A. C. / Lee, Chin-Hui / Rahim, Mazin G. (1996): "Maximum-likelihood stochastic matching approach to non-linear equalization for robust speech recognition", In ICSLP-1996, 1836-1839.