13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Maximising Objective Speech Intelligibility by Local F0 Modulation

Julián Villegas (1,2), Martin Cooke (1,2)

(1) Ikerbasque (Basque Foundation for Science), Spain
(2) Language and Speech Laboratory, University of the Basque Country, Spain

We investigated the effect on objective speech intelligibility of scaling the fundamental frequency (F0) of voiced regions in a set of utterances. The frequency scaling was driven by max- imising the glimpse proportion in voiced epochs, inspired by musical consonance maximisation techniques. Results show that depending on the energetic masker and the signal to noise ratio, F0 modifications increased the mean glimpse proportion by up to 15 %. On average, lower mean F0 changes resulted in greater glimpse proportions. It was also found that the glimpse proportion could be a good predictor of music consonance.

Index Terms: roughness, glimpse proportion, objective speech intelligibility, musical consonance, fundamental frequency

Full Paper

Bibliographic reference.  Villegas, Julián / Cooke, Martin (2012): "Maximising objective speech intelligibility by local F0 modulation", In INTERSPEECH-2012, 1704-1707.