Ninth International Conference on Spoken Language Processing

Pittsburgh, PA, USA
September 17-21, 2006

Further Investigations on the Relationship Between Objective Measures of Speech Quality and Speech Recognition Rates in Noisy Environments

Francisco José Fraga (1), Carlos Alberto Ynoguti (2), André Godoi Chiovato (2)

(1) Federal University of ABC, Brazil; (2) National Institute of Telecommunications, Brazil

The relationship between an objective measure of speech quality (PESQ) and the recognition rate of a given speech recognition system was already investigated by other researchers. In this paper, we present a further investigation on such a relationship. In our research, the speech recognition tests were performed on a wider class of signals and SNR. The experimental setup as well as the speech recognition systems now evaluated were based on the directions given by the Aurora project. Moreover, a new parametric modeling approach for the PESQ-MOS versus speech recognition rate curve, based on the logistic function, is proposed. This new modeling allows some meaningful interpretations of the parameters of the logistic function in terms of system robustness, and permits to make inferences in the regions outside the experimental measures. Furthermore, the PESQ versus SNR characteristic was used to group types of noise, leading to a much better fit of the logistic function over the data points.

Full Paper

Bibliographic reference.  Fraga, Francisco José / Ynoguti, Carlos Alberto / Chiovato, André Godoi (2006): "Further investigations on the relationship between objective measures of speech quality and speech recognition rates in noisy environments", In INTERSPEECH-2006, paper 1877-Mon1CaP.9.