5th International Conference on Spoken Language Processing

Sydney, Australia
November 30 - December 4, 1998

Initial Speech Recognition Results Using The Multinet Architecture

Ednaldo B. Pizzolato, T. Jeff Reynolds

University of Essex, UK

Multinet is a connectionist architecture designed for certain difficult multi-class pattern classification tasks. These are characterised by very large input feature spaces, rendering a monolithic classifier impractical. The architecture consists of a layer with at least one primary 'detector' for each class, followed by a combining net which estimates the posterior probabilities for all classes. Typically primary detectors only input a subset of the input features. Thus the architecture decomposes classification in two ways: by class and by factoring of the input space dimensions. Multinet incorporates the ideas of Modular Neural Networks and Ensembles. In this paper, we investigate the use of Multinet on standard HMM and hybrid HMM-NN systems that we run on the same tasks. The value and potential of the Multinet approach is shown by detailing successive improvements to the Multinet system which are easily obtained because of the modularity of the architecture.

Full Paper

Bibliographic reference.  Pizzolato, Ednaldo B. / Reynolds, T. Jeff (1998): "Initial speech recognition results using the multinet architecture", In ICSLP-1998, paper 0821.