First Workshop on Speech, Language and Audio in Multimedia (SLAM 2013)

Marseille, France
August 22-23, 2013

Slightly Supervised Adaptation of Acoustic Models on Captioned BBC Weather Forecasts

Christian Mohr, Christian Saam, Kevin Kilgour, Jonas Gehring, Sebastian Stüker, Alex Waibel

International Center for Advanced Communication Technologies (interACT), Institute for Anthropomatics, Karlsruhe Institute of Technology, Karlsruhe, Germany

In this paper we investigate the exploitation of loosely transcribed audio data, in the form of captions for weather forecast recordings, in order to adapt acoustic models for automatically transcribing these kinds of forecasts. We focus on dealing with inaccurate time stamps in the captions and the fact that they often deviate from the exact spoken word sequence in the forecasts. Furthermore, different adaptation algorithms are compared when incrementally increasing the amount of adaptation material, for example, by recording new forecasts on a daily basis.

Index Terms: speech recognition, acoustic model adaptation, slightly supervised training, loose transcripts, adaptation methods

Full Paper

Bibliographic reference.  Mohr, Christian / Saam, Christian / Kilgour, Kevin / Gehring, Jonas / Stüker, Sebastian / Waibel, Alex (2013): "Slightly supervised adaptation of acoustic models on captioned BBC weather forecasts", In SLAM-2013, 32-36.