We investigated the effect on objective speech intelligibility of scaling the fundamental frequency (F0) of voiced regions in a set of utterances. The frequency scaling was driven by max- imising the glimpse proportion in voiced epochs, inspired by musical consonance maximisation techniques. Results show that depending on the energetic masker and the signal to noise ratio, F0 modifications increased the mean glimpse proportion by up to 15 %. On average, lower mean F0 changes resulted in greater glimpse proportions. It was also found that the glimpse proportion could be a good predictor of music consonance.
Index Terms: roughness, glimpse proportion, objective speech intelligibility, musical consonance, fundamental frequency
Bibliographic reference. Villegas, Julián / Cooke, Martin (2012): "Maximising objective speech intelligibility by local F0 modulation", In INTERSPEECH-2012, 1704-1707.