DiSS-LPSS Joint Workshop 2010

The 5th Workshop on Disfluency in Spontaneous Speech
The 2nd International Symposium on Linguistic Patterns in Spontaneous Speech

Tokyo, Japan, September 25-26, 2010

Analysis of Prosodic Features for End-of-Utterance Prediction in Spontaneous Japanese

Yuichi Ishimoto (1), Mika Enomoto (2)

(1) Speech Media Group, National Institute of Informatics, Japan
(2) School of Media Science, Tokyo University of Technology, Japan

In this study, we analyzed prosodic features of accentual phrases and investigated their temporal changes to obtain cues for de- tecting boundaries at where turn-taking could occur in sponta- neous conversations. The acoustic parameters used as prosodic features were the fundamental frequency, sound pressure level, and duration of accentual phrases in long utterance units. The results showed that the fundamental frequency shift between the first and second accentual phrases could be useful for detecting the number of accentual phrases in the long utterance unit. In addition, the results suggested that a rapid decrease in sound pressure and an extended duration of the accentual phrase con- stitute a cue for detecting the end of the utterance. That is, the acoustic predictor of the utterance length appeared at the begin- ning of the utterance, and the predictor of the utterance bound- ary appeared shortly before the end of the utterance.

Index Terms. prosody, turn-taking, accentual phrase, long ut- terance unit

Full Paper

Bibliographic reference.  Ishimoto, Yuichi / Enomoto, Mika (2010): "Analysis of prosodic features for end-of-utterance prediction in spontaneous Japanese", In DiSS-LPSS-2010, 97-100.