5th European Conference on Speech Communication and Technology

Rhodes, Greece
September 22-25, 1997

Toward Automatic Transcription of Japanese Broadcast News

Tatsuo Matsuok (1), Yuichi Taguchi (2), Katsutoshi Ohtsuki (1), Sadaoki Furui (3), Katsuhiko Shirai (2)

(1) NTT Human Interface Laboratories, Yokosuka-shi, Kanagawa, Japan
(2) Waseda University, Tokyo, Japan
(3) Tokyo Institute of Technology, Japan

In this paper, we report on the automatic recognition of Japanese broadcast-news speech. We have been working on large-vocabulary continuous speech recognition (LVCSR) for Japanese newspaper speech transcription and have achieved good performance. We have recently applied our LVCSR system to transcribing Japanese broadcast-news speech. We extended the vocabulary from 7k words to 20k words and trained the language models using newspaper texts and broadcast-news manuscripts. These two language models were applied to our evaluation speech sets. The language model trained using broadcast-news manuscripts achieved better results for broadcast-news speech than the language model trained using newspaper texts, which achieved better results for newspaper speech. We achieved a word error rate of 19.7% for anchor-speaker's speech by using a bigram language model and a trigram language model both trained using broadcast-news manuscripts.

Full Paper

Bibliographic reference.  Matsuok, Tatsuo / Taguchi, Yuichi / Ohtsuki, Katsutoshi / Furui, Sadaoki / Shirai, Katsuhiko (1997): "Toward automatic transcription of Japanese broadcast news", In EUROSPEECH-1997, 915-918.