4th International Conference on Spoken Language Processing

Philadelphia, PA, USA
October 3-6, 1996

Parameter Tying for Flexible Speech Recognition

J. Simonin, S. Bodin, D. Jouvet, K. Bartkova

France TÚlÚcom - CNET - LAA/TSS/RCP, Technopole Anticipa, Lannion, France

This paper presents two parameter tying techniques which enable a trade-off between computational cost and recognition performances of a speaker independent flexible speech recognition system working over the telephone network. Parameter tying is conducted at phonetic and acoustic levels. At the phonetic level, allophone and triphone based phonetic modeling are used simultaneously to achieve the best trade-off between computational cost and recognition performances. This decreases error rate with a controlled computational cost as compared to an allophone modeling. At the acoustic level, the tying is performed by clustering the Gaussian densities of mixture distributions. After clustering, a particular density may be use by several distribution. This allows the total number of Gaussian densities to be divided by two while improving the recognition performances.

Full Paper

Bibliographic reference.  Simonin, J. / Bodin, S. / Jouvet, D. / Bartkova, K. (1996): "Parameter tying for flexible speech recognition", In ICSLP-1996, 1089-1092.