13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Automatic Phoneme Segmentation Using Auditory Attention Features

Ozlem Kalinli

Sony Computer Entertainment US R&D, Foster City, CA, USA

Segmentation of speech into phonemes is beneficial for many spoken language processing applications. Here, a novel method which uses auditory attention features for detecting phoneme boundaries from acoustic signal is proposed. The proposed phoneme segmentation method does not require transcription or acoustic models of phonemes. The auditory attention cues are biologically inspired and capture changes in sound characteristics by using 2D spectro-temporal receptive filters. When tested on TIMIT, it is shown that the proposed method successfully predicts phoneme boundaries and performs better than the state-of-the art phoneme segmentation methods.

Index Terms: speech segmentation, phoneme boundary detection, auditory attention model.

Full Paper

Bibliographic reference.  Kalinli, Ozlem (2012): "Automatic phoneme segmentation using auditory attention features", In INTERSPEECH-2012, 2270-2273.