Second European Conference on Speech Communication and Technology

Genova, Italy
September 24-26, 1991


Adapting Input Transformations Using Alpha-Nets for Whole Word Speech Recognition

Michael J. Carey, Eluned S. Parris

Ensigma Ltd. , Chepstow, Gwent, UK

Alpha-nets interpret a set of Hidden Markov Models (HMMs) as a connectionist network enabling back propagation to be used to adapt the parameters of the HMMs, thus providing a powerful technique for improving the discriminatory ability of the HMMs. Previous work has focussed on the adaption of the state parameters of models in which the emission probability is described by a single mode Gaussian distribution. The parameters adapted can be the mixture coefficients themselves in addition to the multivariate means and variances or the elements of the input transformation. The input transformation may be linear or non-linear. We provide experimental results which show that, in a speaker independent isolated digit recognition task using telephone quality speech, adapting the parameters of the input transformation can reduce the error rate observed on the training set, while testing with unseen material produces less improvement in the system performance.

Full Paper

Bibliographic reference.  Carey, Michael J. / Parris, Eluned S. (1991): "Adapting input transformations using alpha-nets for whole word speech recognition", In EUROSPEECH-1991, 555-558.