5th European Conference on Speech Communication and Technology

Rhodes, Greece
September 22-25, 1997

Dealing with Pronunciation Variants at the Language Model Level for the Continuous Automatic Speech Recognition of French

L. Pousse, Guy Pérennou

IRIT University Paul Sabatier, Toulouse, France

In this paper, we describe three approaches of continuous speech recognition. Two of them (referred to as (W,P) and (W',P) models) take into account pronunciation variants of words. They allow to handle (very common) phonological french phenomena like liaisons or mute-e elision. The (W',P) model introduces the phonotypical level as defined in the MHAT Model [4,5]. Comparing (W,P) and (W',P) models show a significant improvement in recognition accuracy when a contextual language model is introduced at this phonotypical level.

Full Paper

Bibliographic reference.  Pérennou, Guy / Pousse, L. (1997): "Dealing with pronunciation variants at the language model level for the continuous automatic speech recognition of French", In EUROSPEECH-1997, 2727-2730.