Ninth International Conference on Spoken Language Processing

Pittsburgh, PA, USA
September 17-21, 2006

Generating Time-Constrained Audio Presentations of Structured Information

Brian Langner, Rohit Kumar, Arthur Chan, Lingyun Gu, Alan W. Black

Carnegie Mellon University, USA

Presenting complex information in an understandable manner using speech is a challenging task to do well. Significant limitations, both in the generation process and from the human listeners’ capabilities, typically make for poorly understood speech. This work examines possible strategies for producing understandable spoken complex information working within those limitations, as well as identifying ways to improve systems to reduce the limitations’ impact. We discuss a simple user study that explores these strategies with complex structured information, and describe a spoken dialog system that will make use of this work to provide a speech interface to structured information in a more understandable manner.

Full Paper

Bibliographic reference.  Langner, Brian / Kumar, Rohit / Chan, Arthur / Gu, Lingyun / Black, Alan W. (2006): "Generating time-constrained audio presentations of structured information", In INTERSPEECH-2006, paper 2075-Thu2A3O.6.