4th International Conference on Spoken Language Processing

Philadelphia, PA, USA
October 3-6, 1996

Transcribing Radio News

Francis Kubala, Tasos Anastasakos, Hubert Jin, Long Nguyen, Richard Schwartz

BBN Systems and Technologies, Cambridge, MA, USA

We have recently extended the capabilities of BBN's large vocabulary discrete-utterance speech recognition system (BYBLOS) to operate on raw audio recordings of radio news programming. The recordings are given to the system as large monolithic waveforms without any additional side-information. Our goal is to transcribe all speech in the input with the highest accuracy possible. The problem is very challenging because radio news programming has frequent changes in speaker, speaking style, dialect, accent, topic, channel, and environmental conditions. Furthermore, the monolithic input presents new problems for recognition algorithms and language models since all useful boundaries (such as speaker turns or sentence ends) are unknown.

Full Paper

Bibliographic reference.  Kubala, Francis / Anastasakos, Tasos / Jin, Hubert / Nguyen, Long / Schwartz, Richard (1996): "Transcribing radio news", In ICSLP-1996, 598-601.