4th International Conference on Spoken Language Processing
Philadelphia, PA, USA
In this paper, we propose a precise but simple inter-word diphone model (IDM) for ward-spotting based on SMQ/HMM. We have applied ordinary diphone models to a speaker-independent, large-vocabulary word recognition unit. However, because users are apt to add words and/or extraneous speech, accuracy degrades due to the mismatch of models at word-boundaries. The IDM represents transition from the preceding phonemes to a word or from a word to the succeeding phonemes. An experiment showed that the IDMa reduce error rates by about 5% for speech containing unknown words and extraneous speech. The experiment also showed that the proposed method ensured performance good enough for the practical use of a large-vocabulary, isolated-word recognition system.
Bibliographic reference. Nitta, Tsuneo / Tanaka, Shin'ichi / Masai, Yasuyuki / Matsu'ura, Hiroshi (1996): "Word-spotting based on inter-word and intra-word diphone models", In ICSLP-1996, 1093-1096.