5th International Conference on Spoken Language Processing

Sydney, Australia
November 30 - December 4, 1998

Acoustic Nature and Perceptual Testing of Corpora of Emotional Speech

Akemi Iida (1), Nick Campbell (2), Soichiro Iga (1), Fumito Higuchi (1), Michiaki Yasumura (1)

(1) Graduate School at Media and Governance, Keio University, Japan
(2) ATR Interpreting Telecommunications Research Laboratories, Japan

This paper proposes three corpora of emotional speech in Japanese that maximize the expression of each emotion (expressing joy, anger, and sadness) for use with CHATR, the concatenative speech synthesis system being developed at ATR. A perceptual experiment was conducted using the synthesized speech generated from each emotion corpus and the results proved to be significantly identifiable. Authors' current work is to identify the local acoustic features relevant for specifying a particular emotion type. F0 and duration showed significant differences among emotion types. AV (amplitude of voicing source) and GN (glottal noise) also showed differences. This paper reports on the corpus design, the perceptual experiment, and the results of the acoustic analysis.

Full Paper

Bibliographic reference.  Iida, Akemi / Campbell, Nick / Iga, Soichiro / Higuchi, Fumito / Yasumura, Michiaki (1998): "Acoustic nature and perceptual testing of corpora of emotional speech", In ICSLP-1998, paper 0818.