Speech Prosody 2006

Dresden, Germany
May 2-5, 2006

Measuring and Modeling Audiovisual Prosody for Animated Agents

Björn Granström, David House

Department of Speech, Music and Hearing, Centre for Speech Technology (CTT), KTH, Stockholm, Sweden

Understanding the interactions between visual expressions, dialogue functions and the acoustics of the corresponding speech presents a substantial challenge. The context of much of our work in this area is to create an animated talking agent capable of displaying realistic communicative behavior and suitable for use in conversational spoken language systems, e.g. a virtual language teacher. In this presentation we will give some examples of recent work, primarily at KTH, involving the collection and analysis of a database for audiovisual prosody. We will report on methods for the acquisition and modeling of visual and acoustic data, and provide some examples of analysis of head nods and eyebrow settings.

Full Paper

Bibliographic reference.  Granström, Björn / House, David (2006): "Measuring and modeling audiovisual prosody for animated agents", In SP-2006, paper 117.