Third International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA 2003)
This paper proposes a new perceptually-based method for assessing speech quality and evaluates its performance. The method is based on comparing the received speech to an appropriate reference representing the closest match from a preformulated codebook. The codebook holds a number of optimally clustered speech parameter vectors extracted from a large number of various undistorted clean speech records. The objective auditory distances between vectors of the distorted speech signal and their corresponding matching references are then measured and appropriately converted into an equivalent subjective score. The optimal clustering of the reference codebook is achieved by using a dynamic k-means method. Efficient data mining technique known as Self-Organising Map is used to match the distorted speech vectors to the references. Speech parameters derived from Bark spectrum analysis, and Mel-Frequency Cepstral coefficients (MFCC) are used to provide speaker independent parametric representation of the speech signals as required by an output-based quality measure.
Index Terms. Speech Processing, Perceptually-Based Speech Quality, Perceptual Quality Measure.
Full Paper (reprinted with permission from Firenze University Press)
Bibliographic reference. Picovici, D. / Mahdi, A. E. (2003): "Perceptually-based objective measure for non-intrusive speech quality assessment", In MAVEBA-2003, 169-172.