EUROSPEECH 2001 Scandinavia
This paper introduces a first approach to emotion recognition using RAMSES, the UPC's speech recognition system. The approach is based on standard speech recognition technology using hidden semi-continuous Markov models. Both the selection of low level features and the design of the recognition system are addressed. Results are given on speaker dependent emotion recognition using the Spanish corpus of INTERFACE emotional speech synthesis database. The accuracy recognising seven different emotions---the six ones defined in MPEG-4 plus neutral style---exceeds 80% using the best combination of low level features and HMM structure. This result is very similar to that obtained with the same database in subjective evaluation by human judges.
Bibliographic reference. Nogueiras, Albino / Moreno, Asunción / Bonafonte, Antonio / Mariño, José B. (2001): "Speech emotion recognition using hidden Markov models", In EUROSPEECH-2001, 2679-2682.