EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology

Aalborg, Denmark
September 3-7, 2001


Thai Grapheme-To-Phoneme Using Probabilistic GLR Parser

Pongthai Tarsaku, Virach Sornlertlamvanich, Rachod Thongprasirt

NECTEC, Thailand

Many difficulties in the Thai language such as the absence of boundary word, linking syllables in pronunciation, and homographs are challenging us in developing a Thai Grapheme-to-Phoneme (G2P) converter. Presently there are a couple Thai G2P systems which are proposed in ruled-based and decision-tree approach. The rule-based approach has a drawback in the limitation of employing the context. The decision-tree approach is somehow able to capture the local context for making the decision. On the contrary, the Probabilistic Generalized LR (PGLR) approach is reported that both the global and local context are efficiently captured in the probabilistic model. In this paper, we implement a Thai G2P system based on the PGLR approach. The result of experiment shows 90.44% of word accuracy in case of ignoring vowels length and 72.87% of word accuracy in case of exact match evaluation. These results are superior to those of rule-based and decision-tree approaches.

Full Paper

Bibliographic reference.  Tarsaku, Pongthai / Sornlertlamvanich, Virach / Thongprasirt, Rachod (2001): "Thai grapheme-to-phoneme using probabilistic GLR parser", In EUROSPEECH-2001, 1057-1060.