Ninth International Conference on Spoken Language Processing

Pittsburgh, PA, USA
September 17-21, 2006

Scalable and Portable Web-Based Multimodal Dialogue Interaction with Geographical Databases

Alexander Gruenstein, Stephanie Seneff, Chao Wang

Massachusetts Institute of Technology, USA

We describe work towards developing a scalable and portable framework for enabling map-based multimodal dialogue interaction over the web. Working in the context of a restaurant-guide system, we show how large information databases harvested from the web can be accommodated in our speech recognizer, parser, and web-based GUI. We compare two dynamic language modeling techniques, which calculate context-dependent weights for the large sets of proper nouns associated with geographical entities such as restaurants and streets. We show that the more fine-grained approach results in a 7.8% reduction in concept error rate.

Full Paper

Bibliographic reference.  Gruenstein, Alexander / Seneff, Stephanie / Wang, Chao (2006): "Scalable and portable web-based multimodal dialogue interaction with geographical databases", In INTERSPEECH-2006, paper 1095-Mon2FoP.2.