INTERSPEECH 2013
14thAnnual Conference of the International Speech Communication Association

Lyon, France
August 25-29, 2013

A New Prosody Annotation Protocol for Live Sports Commentaries

Sandrine Brognaux (1), Benjamin Picart (2), Thomas Drugman (2)

(1) Katholieke Universiteit Leuven, Belgium
(2) Université de Mons, Belgium

This paper proposes a new prosody annotation protocol specific to live sports commentaries. Two levels of annotation are defined with HMM-based speech synthesis in view. Local labels are assigned to all syllables and refer to accentual phenomena. Global labels classify sequences of words into five distinct sub-genres, defined in terms of valence and arousal. The objective of the study is to provide a set of labels both related to a specific function and characterized by a distinct acoustic realization. The consideration of these constraints should allow for an automatic prediction of the labels both from the text or from the speech signal. Reasonable inter-annotator scores are achieved for both annotation levels. A prosodic analysis of all labels also shows that they can usually be distinguished by specific acoustic realizations. The integration of this new annotation protocol within HMM-based speech synthesis shows promising results.

Full Paper

Bibliographic reference.  Brognaux, Sandrine / Picart, Benjamin / Drugman, Thomas (2013): "A new prosody annotation protocol for live sports commentaries", In INTERSPEECH-2013, 1554-1558.