Third International Conference on Spoken Language Processing (ICSLP 94)
This paper describes a method for generating intonation events and prosodic phrasing from a high level linguistic description. Specifically, the input consists of information normally available from linguistic processing: part of speech, constituent structure and, importantly, speech act. The generated output contains explicit intonation events from which an Fo contour may be generated. Prosody can be controlled via features in the input describing the function of words and phrases without direct reference to intonation. The results are evaluated against natural spoken sentences.
Bibliographic reference. Black, Alan W. / Taylor, Paul (1994): "Assigning intonation elements and prosodic phrasing for English speech synthesis from high level linguistic input", In ICSLP-1994, 715-728.