This paper presents a software library, namely ROOTS for Rich Object Oriented Transcription System, that helps to describe spoken messages in a coherent manner linking sequences of items on numerous levels (linguistic, phonological, or acoustic). The proposed representation is incremental and can thus describe any or all parts of an utterance. In order to link different levels of description, algebraic relations are used. Instead of relying solely on fixed, pre-determined relations, algebraic composition operators are proposed that can create a missing relation on demand. In terms of software architecture, object classes are defined based on a well-grounded theoretical representation of speech (text, syntax, phonology and acoustics), without particular dependences on an annotation system (e.g. IPA is fully implemented). The API documentation for this software is available online .
Bibliographic reference. Barbot, Nelly / Barreaud, Vincent / Boëffard, Olivier / Charonnat, Laure / Delhay, Arnaud / Maguer, Sébastien Le / Lolive, Damien (2011): "Towards a versatile multi-layered description of speech corpora using algebraic relations", In INTERSPEECH-2011, 1501-1504.