Fourth European Conference on Speech Communication and Technology

Madrid, Spain
September 18-21, 1995

Improving Recognition Performances on Field Data with an A-Priori Segmentation of the Speech Signal

T. Moudenc, D. Jouvet, J. Monné

France Telecom CNET, Technopole Anticipa, Lannion, France

This paper investigates the modelling of speech signal stationarity changes in order to improve both recognitions and rejection performances of an HMM based speech recognition system on field data, which also implies the rejection of noises and out-of-vocabulary utterances. Implemented in an N-best solution post-processing, the use of discrete probability distributions of the number of speech signal stationarity changes inside each phonetic segment enables the computation of a segmental postprocessing score. The experiments conducted on afield recording database resulted in a 6 % reduction in the substitution error rate on the correct and truncated utterances, and a 35 % reduction in the false alarm error rate on noises and out-of-vocabulary utterances. This yielded a 20 % global error rate reduction.

Full Paper

Bibliographic reference.  Moudenc, T. / Jouvet, D. / Monné, J. (1995): "Improving recognition performances on field data with an a-priori segmentation of the speech signal", In EUROSPEECH-1995, 1479-1482.