12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Image Processing Filters for Line Detection-Based Spoken Term Detection

Kazuyuki Noritake, Hiroaki Nanjo, Takehiko Yoshimi

Ryukoku University, Japan

Spoken term detection (STD) from oral presentations is addressed. Specifically, we regard STD as a line detection problem in an image file, in which each pixel holds a syllable-distance between query term and automatic speech recognition (ASR) results. Since such kind of image file essentially includes ASR errors, line detection in noisy image should be investigated. In this paper, we propose line detection-oriented image processing filters for STD. We achieved 0.39 of F-measure for low frequency term (out of vocabulary term in ASR system) detection task, and 0.69 of F-measure for known term (in-vocabulary term in ASR system) detection task.

Full Paper

Bibliographic reference.  Noritake, Kazuyuki / Nanjo, Hiroaki / Yoshimi, Takehiko (2011): "Image processing filters for line detection-based spoken term detection", In INTERSPEECH-2011, 2125-2128.