Speech Prosody 2010

Chicago, IL, USA
May 10-14, 2010

Prosody, Supporting Real-Time Conversation

Hiroki Oohashi (1), Tomoko Ohsuga (2), Yasuo Horiuchi (3), Hideaki Kikuchi (1), Akira Ichikawa (1)

(1) Faculty of Human Science, Waseda University, Tokyo, Japan
(2) National Institute of Informatics, Japan
(3) Graduate School of Advanced Integration Science Chiba University, Japan

We assume that prosody contains information forenoticing segment boundaries, syntactic structures, and turn transitions and enable us to predict these more easily. We examined this assumption using the F0 model. Concretely speaking, as for forenotices, we examined whether or not the F0 model parameters can lead to segment boundaries, dependencies of phrases, and turn transitions. On the other hand, as for predictions, we conducted cognitive experiments on turn-taking by presenting stimulations containing only prosody and not phonological information. As a result, the segment boundaries were exactly forenoticed at an accuracy of about 60%, the dependencies of phrases were done at about 80%, the turn transitions were done at about 70%, and the possibility of predictions about turn transitions was indicated.

Index Terms: real-time conversation, word segmentation, turntaking, syntactic structure, F0 model

Full Paper

Bibliographic reference.  Oohashi, Hiroki / Ohsuga, Tomoko / Horiuchi, Yasuo / Kikuchi, Hideaki / Ichikawa, Akira (2010): "Prosody, supporting real-time conversation", In SP-2010, paper 095.