Ninth International Conference on Spoken Language Processing

Pittsburgh, PA, USA
September 17-21, 2006

Saliency Parsing for Automated Directory Assistance

Issac Alphonso, Shuangyu Chang

Tellme Networks Inc., USA

In a statistical language model based automated directory assistance system, extracting the salient information from the recognition output can significantly increase the accuracy of the backend listing database search. In this paper, we describe a Hidden Markov model (HMM) based saliency parser that was developed to accurately and efficiently identify salient words from the recognition output by modeling both the syntactic structure as well as the lexical distribution. The parser can be trained using a relatively small data set with coarse syntactic class labels, without the need for detailed syntactic knowledge or a treebank-like corpus. Experimental results on a research corpus of directory assistance utterances betoken the parserís importance within the automated system. The results demonstrate that the proposed saliency parser can significantly improve the overall automation rate without increasing the error rate.

Full Paper

Bibliographic reference.  Alphonso, Issac / Chang, Shuangyu (2006): "Saliency parsing for automated directory assistance", In INTERSPEECH-2006, paper 1421-Mon2WeO.1.