Speech Prosody 2010

Chicago, IL, USA
May 10-14, 2010

Acoustic Cues for Automatic Determination of Phrasing

Agnieszka Wagner

Department of Phonetics, Adam Mickiewicz University in Poznañ, Poland

This paper proposes a framework of automatic determination of phrasing using acoustic features derived from the speech signal. The feature vectors were defined in a series of analyses investigating the acoustic-phonetic realization of minor and major phrase boundaries and different boundary types. The resulting representation was used to train statistical classifiers to automatically determine phrase boundary position and type. The output of the classifiers can be used to provide speech corpora with phrasing information to enhance the performance of TTS or ASR systems, or to generate a comprehensive feedback in prosody tutoring systems. Apart from providing an efficient means for automatic phrase boundary detection, the study presented in this paper sheds also light on the role of timing and F0 cues in signaling phrase boundaries.

Index Terms: phrasing, boundary tones, prosody labeling

Full Paper

Bibliographic reference.  Wagner, Agnieszka (2010): "Acoustic cues for automatic determination of phrasing", In SP-2010, paper 196.