13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Combining Ranking and Classification to Improve Emotion Recognition in Spontaneous Speech

Houwei Cao (1), Ragini Verma (1), Ani Nenkova (2)

(1) Department of Radiology, Section of Biomedical Image Analysis
(2) Department of Computer and Information Science
University of Pennsylvania, USA

We introduce a novel emotion recognition approach which integrates ranking models. The approach is speaker independent, yet it is designed to exploit information from utterances from the same speaker in the test set before making predictions. In this manner, it achieves much higher precision in identifying emotional utterances than a conventional SVM classifier. Furthermore we test several possibilities for combining conventional classification and predictions based on ranking. All combinations improve overall prediction accuracy. All experiments are performed on the FAU AIBO database which contains realistic spontaneous emotional speech. Our best combination system achieves 6.6% absolute improvement over the Interspeech 2009 emotion challenge baseline system on the 5-class classification tasks.

Index Terms: emotion classification, ranking models, spontaneous speech

Full Paper

Bibliographic reference.  Cao, Houwei / Verma, Ragini / Nenkova, Ani (2012): "Combining ranking and classification to improve emotion recognition in spontaneous speech", In INTERSPEECH-2012, 358-361.