4th International Conference on Spoken Language Processing

Philadelphia, PA, USA
October 3-6, 1996

Robust Speech Recognition with Speaker Localization by a Microphone Array

Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano

Graduate School of Information Science, Nara Institute of Science and Technology, Nara, Japan

This paper proposes robust speech recognition with Speaker Localization by a Arrayed Microphone (SLAM) to realize hands-free speech interface in noisy environments. In order to localize a speaker direction accurately in low SNR conditions, a speaker localization algorithm based on extracting a pitch harmonics is introduced. To evaluate the performance of the proposed system, speech recognition experiments are carried out both in computer simulation and real environments. These results show that the proposed system attains the much higher speech recognition performance than that of a single microphone not only in computer simulation but also in real environments.

Full Paper

Bibliographic reference.  Yamada, Takeshi / Nakamura, Satoshi / Shikano, Kiyohiro (1996): "Robust speech recognition with speaker localization by a microphone array", In ICSLP-1996, 1317-1320.