Third International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA 2003)

Florence, Italy
December 10-12, 2003

Automatic Detection of Stress in Speech

H. J. Fell (1), Joel MacAuslan (2)

(1) College of Computer and Information Science, Northeastern University, MA, USA
(2) Speech Technology and Applied Research, Lexington, MA, USA

We have developed software based on the Stevens landmark theory to extract features in utterances in and adjacent to voiced regions. We then apply two statistical methods, closest-match (CM) and principal components analysis (PCA), to these features to classify utterances according to their emotional content. Using a subset of samples from the Actual Stress portion of the SUSAS database as a reference set, we automatically classify the emotional state of other samples with 75% accuracy, using CM either alone or with PCA and CM together. The accuracy apparently does not depend strongly on measurement errors or other small details of the present data, giving confidence that the results will be applicable to other data.

Index Terms. automatic detection, emotion, speech, stress

Full Paper (reprinted with permission from Firenze University Press)

Bibliographic reference.  Fell, H. J. / MacAuslan, Joel (2003): "Automatic detection of stress in speech", In MAVEBA-2003, 9-12.