Speech Prosody 2006

Dresden, Germany
May 2-5, 2006

Decomposition of Pitch Curves in the General Superpositional Intonation Model

Taniya Mishra, Jan van Santen, Esther Klabbers

Center for Spoken Language Understanding, OGI School of Science & Engineering, Oregon Health & Science University, Beaverton, OR, USA

This paper describes and applies a new algorithm for decomposing pitch curves into component curves, in accordance with the General Superpositional Model of Intonation. According to this model, which is a generalization of the Fujisaki model [3], a pitch contour can be described as the sum of component curves that are each associated with different phonological levels, including the phrase, foot, and phoneme. The algorithm assumes that the phrase curve is locally linear during intervals spanned by a foot. The algorithm was evaluated using synthetically generated curves, and was found to accurately recover the synthetic component curves. The algorithm was also evaluated in a perceptual experiment, where speech generated by concatenation of accent curves was shown to produce better speech quality than speech based on direct concatenation of "raw" pitch curve fragments.

Full Paper

Bibliographic reference.  Mishra, Taniya / Santen, Jan van / Klabbers, Esther (2006): "Decomposition of pitch curves in the general superpositional intonation model", In SP-2006, paper 108.