14thAnnual Conference of the International Speech Communication Association

Lyon, France
August 25-29, 2013

Extended Weighted Linear Prediction Using the Autocorrelation Snapshot — A Robust Speech Analysis Method and its Application to Recognition of Vocal Emotions

Jouni Pohjalainen, Paavo Alku

Aalto University, Finland

Temporally weighted linear predictive methods have recently been successfully used for robust feature extraction in speech and speaker recognition. This paper introduces their general formulation, where various efficient temporal weighting functions can be included in the optimization of the all-pole coefficients of a linear predictive model. Temporal weighting is imposed by multiplying elements of instantaneous autocorrelation "snapshot" matrices computed from speech data. With this novel autocorrelation-snapshot formulation of weighted linear prediction, it is demonstrated that different temporal aspects of speech can be emphasized in order to enhance robustness of feature extraction in speech emotion recognition.

Full Paper

Bibliographic reference.  Pohjalainen, Jouni / Alku, Paavo (2013): "Extended weighted linear prediction using the autocorrelation snapshot — a robust speech analysis method and its application to recognition of vocal emotions", In INTERSPEECH-2013, 1931-1935.