Speech Prosody 2008

Campinas, Brazil
May 6-9, 2008

How We are not Equally Competent for Discriminating Acted from Spontaneous Expressive Speech

Nicolas Audibert (1), Véronique Aubergé (1), Albert Rilliard (2)

(1) Gipsa-lab Speech & Cognition Dept (Institut de la Communication Parlée), CNRS UMR 5216/Université Stendhal, Grenoble, France
(2) LIMSI-CNRS, Orsay, France

This paper reports how acted vs. spontaneous expressive speech can be discriminated by human listeners, with various performances depending on the listener (in line with preliminary results for amusement by [3]). The perceptive material was taken from the Sound Teacher/E-Wiz corpus [1], for 4 French-speaking actors trapped in spontaneous expressive monoword utterances, and then acting immediately after, in an acting protocol supposed to be a very convenient for them. Pairs of acted vs. spontaneous stimuli, expressing affective states related to anxiety, irritation and satisfaction, were rated by 33 native French listeners in audio-only, visualonly and audiovisual conditions. In visual-only condition, 70% of listeners were able to identify acted vs. spontaneous pairs over chance level, for 78% in audio-only condition and up to 85% in audio-visual condition. Globally, a highly significant subject effect confirms the hypothesis of a varied affective competence for separating involuntary vs. simulated affects [2]. One feature used by listeners in the acoustic task of discrimination can be the perceived emotional intensity, in accordance with the measurement of this intensity level for the same stimuli from a previous perception experiment by Laukka and al. [4].

References

  1. Aubergé, V.; Audibert, N.; Rilliard, A., 2004. E-Wiz: A Trapper Protocol for Hunting the Expressive Speech Corpora in Lab. 4th LREC, Lisbon, Portugal, 179-182.
  2. Aubergé V. 2002. A Gestalt morphology of prosody directed by functions: the example of a step by step model developed at ICP. 1st Speech Prosody, Aix-en- Provence, France, 151-155.
  3. Aubergé, V.; Cathiard, M., 2003. Can we hear the prosody of smile? Speech Communication 40, Special issue on Emotional Speech, 87-97.
  4. Laukka, P.; Audibert, N.; Aubergé, V., 2007. Graded structure in vocal expression of emotion: What is meant by "prototypical expressions"? WS Paralinguistic Speech - between models and data, Saarbrücken, Germany, 1-4.

Full Paper

Bibliographic reference.  Audibert, Nicolas / Aubergé, Véronique / Rilliard, Albert (2008): "How we are not equally competent for discriminating acted from spontaneous expressive speech", In SP-2008, 693-696.