First European Conference on Speech Communication and Technology

Paris, France
September 27-29, 1989

Phoneme Recognition Expert System Using Spectrogram Reading Knowledge and Neural Networks

Yasuhiro Komori, Kaichiro Hatazaki, Takaharu Tanaka, Takeshi Kawabata, Kiyohiro Shikano

ATR Interpreting Telephony Research Laboratories, Seika-cho, Soraku-gun, Kyoto Japan

We present a method for phoneme recognition using an expert system combining spectrogram reading knowledge and neural networks, and we report its performance. The proposed expert system consists of two parts: (1) phoneme segmentation based on spectrogram reading knowledge used by human experts, and (2) phoneme identification using neural networks applied to the phoneme boundaries determined in phoneme segmentation. Highly accurate phoneme segmentation can be achieved by using human-like contextual spectrogram reading knowledge. Moreover, high performance phoneme identification can be achieved by applying neural networks to the accurate phoneme segmentation result. The system was tested on Japanese consonants, with 90.8% of the phonemes correctly segmented and 92.4% of the phonemes correctly identified within the correct segment. 83.9% of the phonemes were correctly recognized both in segmentation and identification.

Full Paper

Bibliographic reference.  Komori, Yasuhiro / Hatazaki, Kaichiro / Tanaka, Takaharu / Kawabata, Takeshi / Shikano, Kiyohiro (1989): "Phoneme recognition expert system using spectrogram reading knowledge and neural networks", In EUROSPEECH-1989, 2549-2552.