![]() |
Intonation: Theory, Models, and ApplicationsAthens, Greece |
![]() |
In our research on the prosody of Finnish we have modeled word-level prosody with artificial neural networks and a comparable model based on multiple linear regression. In earlier work [6] we have used multi-layer feed-forward networks to estimate prosodic parameters - segmental durations, loudness and pitch values - for single phones within words. The performance of these models in predicting average values for phones was encouraging. Nevertheless, a more detailed account of the behavior of the continuous prosodic parameters is needed, e.g., to enhance the intelligibility and naturalness of speech synthesis. We present results for both models' ability to estimate microprosodic variation of pitch in Finnish.
Bibliographic reference. Vainio, M. / Altosaar, Toomas / Karjalainen, Marti / Aulanko, R. (1997): "Modeling Finnish microprosody for speech synthesis", In INT-1997, 309-312.