EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology
2nd INTERSPEECH Event

Aalborg, Denmark
September 3-7, 2001

                 

Speech Emotion Recognition Using Hidden Markov Models

Albino Nogueiras, Asunción Moreno, Antonio Bonafonte, José B. Mariño

Research Center TALP UPC, Spain

This paper introduces a first approach to emotion recognition using RAMSES, the UPC's speech recognition system. The approach is based on standard speech recognition technology using hidden semi-continuous Markov models. Both the selection of low level features and the design of the recognition system are addressed. Results are given on speaker dependent emotion recognition using the Spanish corpus of INTERFACE emotional speech synthesis database. The accuracy recognising seven different emotions---the six ones defined in MPEG-4 plus neutral style---exceeds 80% using the best combination of low level features and HMM structure. This result is very similar to that obtained with the same database in subjective evaluation by human judges.

Full Paper

Bibliographic reference.  Nogueiras, Albino / Moreno, Asunción / Bonafonte, Antonio / Mariño, José B. (2001): "Speech emotion recognition using hidden Markov models", In EUROSPEECH-2001, 2679-2682.