Third International Conference on Spoken Language Processing (ICSLP 94)

Yokohama, Japan
September 18-22, 1994

On Inter-Phrase Context Dependencies in Continuously Read Japanese Speech

Kazuhiro Kondo (1), Yu-Hung Kao (2), Barbara Wheatley (2)

(1) Texas Instruments, Tsukuba Research and Development Center, Tsukuba, Ibaraki, Japan
(2) Texas Instruments, Systems and Information Science Laboratory, Dallas, Texas, USA

This paper investigates methods to model inter-phrase or word context for continuous Japanese speech recognition. It was found that by compiling a network of context-dependent phonetic models which models the inter-word or inter-phrase context, recognition error reduction by 32% can be achieved compared to models which do not account for inter-word context. However, this will significantly increase the number of phonetic models required to model the vocabulary. To overcome this increase, we clustered the inter-word/phrase context into only a few classes. Using one class for consonant inter-word context and two classes for vowel context, the recognition accuracy on digit string recognition was found to be virtually equal to the accuracy with unclustered models, while the number of phonetic models required was reduced by more than 50%.

Full Paper

Bibliographic reference.  Kondo, Kazuhiro / Kao, Yu-Hung / Wheatley, Barbara (1994): "On inter-phrase context dependencies in continuously read Japanese speech", In ICSLP-1994, 2179-2182.