EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology

Aalborg, Denmark
September 3-7, 2001


Three-Dimensional Modelling of Speech Corpora: Added Value through Visualisation

Toomas Altosaar (1), Matti Karjalainen (1), Martti Vainio (2)

(1) Helsinki University of Technology, Finland
(2) University of Helsinki, Finland

Collections of annotated spoken language have formed an important basis for the development of speech technology. Their existence has promoted speech analysis research as well as enabled robust synthesis and recognition methods to be developed. However, many complex relationships remain unspecified within a corpus due to a lack of metadata that describes the raw information in sufficient detail as well as the inter-relationships between signals, recording conditions, talkers, etc. A deficit of standards and formats, needed to express complex relationships, has also hindered the potential use and value of available corpora. This paper presents a novel three-dimensional model for exploring temporal as well as atemporal information existing in speech corpora. Examined are the potential benefits that are gained through corpus visualisation during the phases of creation, editing, verification, use, and exploration. The paper suggests that by providing a threedimensional model of speech data, more of the inherent and potential value of a corpus can be utilised.

Full Paper

Bibliographic reference.  Altosaar, Toomas / Karjalainen, Matti / Vainio, Martti (2001): "Three-dimensional modelling of speech corpora: added value through visualisation", In EUROSPEECH-2001, 1537-1540.