13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Open-Vocabulary Retrieval of Spoken Content with Shorter/Longer Queries Considering Word/Subword-based Acoustic Feature Similarity

Huny-yi Lee (1), Po-wei Chou (2), Lin-shan Lee (1)

(1) Graduate Institute of Communication Engineering; (2) Department of Electrical Engineering;
National Taiwan University, Taiwan

Acoustic feature similarity between utterances has been shown to be very helpful for spoken term detection using pseudo-relevance feedback (PRF) and graph-based re-ranking. Both cases are based on the concept that utterances similar to those utterances with higher relevance scores in acoustic features should have higher scores, while graph-based re-ranking further considers the similarity structure between many utterances globally with a graph. In this paper, we extend these approaches to consider acoustic feature similarity between utterances over both word and subword lattices, and offer a complete formulation for the general problem of open vocabulary retrieval of spoken content with shorter or longer queries. All these are verified by significant improvements in preliminary experiments with both in vocabulary (IV) and OOV queries.

Index Terms: Spoken Content Retrieval, Pseudo-relevance Feedback, Random Walk

Full Paper

Bibliographic reference.  Lee, Huny-yi / Chou, Po-wei / Lee, Lin-shan (2012): "Open-vocabulary retrieval of spoken content with shorter/longer queries considering word/subword-based acoustic feature similarity", In INTERSPEECH-2012, 2077-2080.