Speech Prosody 2004

Nara, Japan
March 23-26, 2004

Vietnamese Tones Generation Using F0 and Power Patterns

Tu Trong Do, Tomio Takara

Department of Information Engineering, University of the Ryukyus, Japan

We propose a Vietnamese Text-To-Speech (VieTTS) system in which F0 and power patterns are used to generate Vietnamese tones precisely. Fundamental speech units of this system are demisyllables with Level tone. VieTTS uses a source-filter model for speech production and a Log Magnitude Approximation (LMA) filter as the vocal tract filter. We chose the Hanoi dialect for VieTTS. Tone synthesis of Vietnamese is implemented by using fundamental frequency (F0) patterns and power pattern control. F0 is the most important factor in Vietnamese tone synthesis and the power control strongly affects Broken and Drop tones. Applying power control for tone synthesis is effective and unique for Vietnamese compared to other tonal languages such as Chinese and Thai.

Full Paper

Bibliographic reference.  Do, Tu Trong / Takara, Tomio (2004): "Vietnamese tones generation using F0 and power patterns", In SP-2004, 471-474.