13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Lexical-Phonetic Automata for Spoken Utterance Indexing and Retrieval

Julien Fayolle (1), Murat Saraçlar (2), Fabienne Moreau (1), Christian Raymond (1), Guillaume Gravier (1)

(1) IRISA (INRIA, University of Rennes 2, INSA, CNRS), Rennes, France
(2) Department of Electrical and Electronic Engineering, Boğaziçi University, Istanbul, Turkey

This paper presents a method for indexing spoken utterances which combines lexical and phonetic hypotheses in a hybrid index built from automata. The retrieval is realised by a lexical-phonetic and semi-imperfect matching whose aim is to improve the recall. A feature vector, containing edit distance scores and a confidence measure, weights each transition to help the filtering of the candidate utterance list for a more precise search. Experiment results show the complementarity of the lexical and phonetic representations, and compare the hybrid search with the state-of-the-art cascaded search to retrieve named entity queries.

Index Terms: information retrieval, speech indexing, lexical-phonetic automata, confidence measures, edit distances, supervised learning

Full Paper

Bibliographic reference.  Fayolle, Julien / Saraçlar, Murat / Moreau, Fabienne / Raymond, Christian / Gravier, Guillaume (2012): "Lexical-phonetic automata for spoken utterance indexing and retrieval", In INTERSPEECH-2012, 2470-2473.