4th International Conference on Spoken Language Processing
Philadelphia, PA, USA
This paper proposes robust speech recognition with Speaker Localization by a Arrayed Microphone (SLAM) to realize hands-free speech interface in noisy environments. In order to localize a speaker direction accurately in low SNR conditions, a speaker localization algorithm based on extracting a pitch harmonics is introduced. To evaluate the performance of the proposed system, speech recognition experiments are carried out both in computer simulation and real environments. These results show that the proposed system attains the much higher speech recognition performance than that of a single microphone not only in computer simulation but also in real environments.
Bibliographic reference. Yamada, Takeshi / Nakamura, Satoshi / Shikano, Kiyohiro (1996): "Robust speech recognition with speaker localization by a microphone array", In ICSLP-1996, 1317-1320.