Intonation: Theory, Models, and Applications
A new approach for analyzing, coding, predicting and generating F0 curves based on linear predictive coding  is presented. The suggested model supports an automatic extraction of different parameters and accents from intonational contours by means of an appropriately simplified rectangular residual function and typical filter coefficients. The signal approximation can be regarded as an excitation signal for prosodical accents and other gestures, as a result of the pragmatical, semantical, syntactical and syllabical structure of the sentence. The resulting filter coefficients can be interpreted either as physiological parameters  or as speaker-individual features in the intonation shape. The method was developed for both practical prosodical data analysis and contour resynthesis in TTS or dialog machines. Perceptual evaluation also indicated first success for invariant filter structures and highly simplified residual signals. In this first approach the model is perceptually motivated  and directed towards the intonational gestalt on the sentence or phrase level.
Bibliographic reference. Mersdorf, Joachim / Rinscheid, Ansgar / Brüggen, Marc / Schmidt, Kai Uwe (1997): "Coding of large intonational units by linear prediction", In INT-1997, 235-238.