13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Robust Phoneme Recognition Based on Biomimetic Speech Contours

Michael A. Carlin, Kailash Patil, Sridhar Krishna Nemala, Mounya Elhilali

Dept. of Electrical and Computer Engineering & Center for Language and Speech Processing, Johns Hopkins University, Baltimore, MD, USA

It has been previously suggested that ensembles of central auditory neurons optimize a sustained firing criterion as part of the underlying code for representing sound. Moreover, computational studies have shown that optimizing such a criterion yields ensembles of spectro-temporal receptive fields akin to those observed in physiological studies. In this study we show that these emergent receptive fields contour the high energy modulations in speech. A simple 2D filter thus derived is shown to improve upon the performance of state-of-the-art phoneme recognition systems under both additive noise conditions and reverberation by 6.2% absolute on average.

Index Terms: robust feature extraction, bio-inspired features, sustained neural firings

Full Paper

Bibliographic reference.  Carlin, Michael A. / Patil, Kailash / Nemala, Sridhar Krishna / Elhilali, Mounya (2012): "Robust phoneme recognition based on biomimetic speech contours", In INTERSPEECH-2012, 1348-1351.