EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology

Aalborg, Denmark
September 3-7, 2001


Creating a European English Broadcast News Transcription Corpus and System

Gerhard Backfried, Robert Hecht, Sabine Loots, Norbert Pfannerer, Jürgen Riedler, Christian Schiefer

Sail Labs, Austria

Based on BBN's Rough'n'Ready suite of technologies used in the DARPA Hub-4 evaluations we describe the Sail-Labs Media Indexer system aiming at processing European English television broadcasts. We discuss the development of a European English broadcast news corpus, suitable for measuring performance of system components, such as speaker identification and speech recognition. We further report evaluation results on our multi-purpose test set, and outline the integration of real-time indexing into a spoken document retrieval system.

Full Paper

Bibliographic reference.  Backfried, Gerhard / Hecht, Robert / Loots, Sabine / Pfannerer, Norbert / Riedler, Jürgen / Schiefer, Christian (2001): "Creating a european English broadcast news transcription corpus and system", In EUROSPEECH-2001, 2039-2042.