Ninth International Conference on Spoken Language Processing

Pittsburgh, PA, USA
September 17-21, 2006

Reducing Speech Coding Distortion for Speaker Identification

Alan McCree

Massachusetts Institute of Technology, USA

In this paper, we investigate the degradation of speaker identification performance due to speech coding algorithms used in digital telephone networks, cellular telephony, and voice over IP. By analyzing the difference between front-end feature vectors derived from coded and uncoded speech in terms of spectral distortion, we are able to quantify this coding degradation. This leads to two novel methods for distortion compensation: codebook and LPC compensation. Both are shown to significantly reduce front-end mismatch, with the second approach providing the most encouraging results. Full experiments using a GMMUBM speaker ID system confirm the usefulness of both the front-end distortion analysis and the LPC compensation technique.

Full Paper

Bibliographic reference.  McCree, Alan (2006): "Reducing speech coding distortion for speaker identification", In INTERSPEECH-2006, paper 1989-Tue1CaP.10.