Speech Prosody 2010

Chicago, IL, USA
May 10-14, 2010

The qTA Toolkit for Prosody: Learning Underlying Parameters of Communicative Functions through Modeling

Santitham Prom-on (1), Yi Xu (2)

(1) Department of Computer Engineering, King Mongkut's University of Technology Thonburi, Thailand
(2) Department of Speech, Hearing, and Phonetic Sciences, University College London, UK

This paper presents the qTA toolkit, a general-purpose research toolkit for studying speech prosody. The toolkit consists of analysis and visualization tools. The analysis tool processes F0 and timing data together with annotation of communicative functions to estimate function-specific underlying pitch targets and their function-specific adjustments. The visualization tool generates illustrations of the synthesized F0 contour and their pitch target input. As an initial test, the qTA toolkit is applied to a Mandarin corpus, and the results suggest that it can be effectively used for investigating prosody in terms of communicative functions.

Index Terms: quantitative target approximation, qTA model, pitch target, communicative function, research toolkit

Full Paper

Bibliographic reference.  Prom-on, Santitham / Xu, Yi (2010): "The qTA toolkit for prosody: learning underlying parameters of communicative functions through modeling", In SP-2010, paper 034.