ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

A new prosody annotation protocol for live sports commentaries

Sandrine Brognaux, Benjamin Picart, Thomas Drugman

This paper proposes a new prosody annotation protocol specific to live sports commentaries. Two levels of annotation are defined with HMM-based speech synthesis in view. Local labels are assigned to all syllables and refer to accentual phenomena. Global labels classify sequences of words into five distinct sub-genres, defined in terms of valence and arousal. The objective of the study is to provide a set of labels both related to a specific function and characterized by a distinct acoustic realization. The consideration of these constraints should allow for an automatic prediction of the labels both from the text or from the speech signal. Reasonable inter-annotator scores are achieved for both annotation levels. A prosodic analysis of all labels also shows that they can usually be distinguished by specific acoustic realizations. The integration of this new annotation protocol within HMM-based speech synthesis shows promising results.

doi: 10.21437/Interspeech.2013-393

Cite as: Brognaux, S., Picart, B., Drugman, T. (2013) A new prosody annotation protocol for live sports commentaries. Proc. Interspeech 2013, 1554-1558, doi: 10.21437/Interspeech.2013-393

  author={Sandrine Brognaux and Benjamin Picart and Thomas Drugman},
  title={{A new prosody annotation protocol for live sports commentaries}},
  booktitle={Proc. Interspeech 2013},