Speech Prosody 2004
Forty-four stimuli were made from the unemotional utterance "eh" with duration changes (4 levels) and range of F0 (11 levels). Ten adult participants were asked to judge if the stimuli were congruent with the contexts (disagreement, hesitation, and agreement). Stimuli with rising tones tended to be identified as "surprise." On the other hand, stimuli with falling tones were identified as "postponement" when their duration was long, and were identified as "affirmation" when their duration was short. The results indicated that the duration and the ranges of F0 should be effective in identifying the contexts in which they were spoken.
Bibliographic reference. Nagasaki, Yasuko / Komatsu, Takanori (2004): "Can people perceive different emotions from a non-emotional voice by modifying its F0 and duration?", In SP-2004, 667-670.