Sixth ISCA Workshop on Speech Synthesis

Bonn, Germany
August 22-24, 2007

Control Concepts for Articulatory Speech Synthesis

Peter Birkholz (1), Ingmar Steiner (2), Stefan Breuer (3)

(1) Institute for Computer Science, University of Rostock, Germany
(2) Department of Computational Linguistics and Phonetics, Saarland University, Germany
(3)Institute of Communication Sciences (IfK), University of Bonn, Germany

We present two concepts for the generation of gestural scores to control an articulatory speech synthesizer. Gestural scores are the common input to the synthesizer and constitute an organized pattern of articulatory gestures. The first concept generates the gestures for an utterance using the phonetic transcriptions, phone durations, and intonation commands predicted by the Bonn Open Synthesis System (BOSS) from an arbitrary input text. This concept extends the synthesizer to a text-to-speech synthesis system. The idea of the second concept is to use timing information extracted from Electromagnetic Articulography signals to generate the articulatory gestures. Therefore, it is a concept for the re-synthesis of natural utterances. Finally, application prospects for the presented synthesizer are discussed.

Full Paper   Presentation (ppt) with embedded acoustic examples

Bibliographic reference.  Birkholz, Peter / Steiner, Ingmar / Breuer, Stefan (2007): "Control concepts for articulatory speech synthesis", In SSW6-2007, 5-10.