Third International Conference on Spoken Language Processing (ICSLP 94)

Yokohama, Japan
September 18-22, 1994

System of Microphone Arrays and Neural Networks for Robust Speech Recognition in Multimedia Environments

Qiguang Lin (1), Ea-Ee Jan (1), Chi Wei Che (1), Bert de Vries (2)

(1) CAIP Center, Rutgers University, Piscataway, NJ, USA
(2) David Sarnoff Research Center, Princeton, NJ, USA

Hands-free operation of speech processing systems is sometimes desired to avoid encumbrance of the user by tethered microphone equipment. This paper explores the use of array microphones and neural networks (MANN) for robust speech recognition in real-world environments, such as large-group conferencing. Microphone arrays (MA) provide high-quality, hands-free sound pickup under severe acoustical conditions; and neural network (NN) processors "learn" the characteristics of environmental interference and transform features of MA-enhanced signal to those obtained under close-talking conditions. In this study, both realroom collected and computer-simulated reverberant speech signals are used to evaluate the power and advantages of MANN for direct deployment of speech recognition technology in adverse practical environments.

Full Paper

Bibliographic reference.  Lin, Qiguang / Jan, Ea-Ee / Che, Chi Wei / Vries, Bert de (1994): "System of microphone arrays and neural networks for robust speech recognition in multimedia environments", In ICSLP-1994, 1247-1250.