INTERSPEECH 2011
12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Towards a Versatile Multi-Layered Description of Speech Corpora Using Algebraic Relations

Nelly Barbot, Vincent Barreaud, Olivier Boëffard, Laure Charonnat, Arnaud Delhay, Sébastien Le Maguer, Damien Lolive

IRISA, France

This paper presents a software library, namely ROOTS for Rich Object Oriented Transcription System, that helps to describe spoken messages in a coherent manner linking sequences of items on numerous levels (linguistic, phonological, or acoustic). The proposed representation is incremental and can thus describe any or all parts of an utterance. In order to link different levels of description, algebraic relations are used. Instead of relying solely on fixed, pre-determined relations, algebraic composition operators are proposed that can create a missing relation on demand. In terms of software architecture, object classes are defined based on a well-grounded theoretical representation of speech (text, syntax, phonology and acoustics), without particular dependences on an annotation system (e.g. IPA is fully implemented). The API documentation for this software is available online [7].

Reference

  1. ROOTS homepage, http://www.irisa.fr/cordial/roots, 2011.

Full Paper

Bibliographic reference.  Barbot, Nelly / Barreaud, Vincent / Boëffard, Olivier / Charonnat, Laure / Delhay, Arnaud / Maguer, Sébastien Le / Lolive, Damien (2011): "Towards a versatile multi-layered description of speech corpora using algebraic relations", In INTERSPEECH-2011, 1501-1504.