Third International Conference on Spoken Language Processing (ICSLP 94)

Yokohama, Japan
September 18-22, 1994

Speech Coding Based on Adaptive MEL-Cepstral Analysis for Noisy Channels

Kazuhito Koishida (1), Keiichi Tokuda (2), Takao Kobayashi (1), Satoshi Imai (1)

(1) Precision and Intelligence Laboratory, Tokyo Institute of Technology, Yokohama, Japan
(2) Department of Electrical and Electronic Engineering, Tokyo Institute of Technology, Tokyo, Japan

In this paper, we examine the robustness of an ADPCM coder based on adaptive mel-cepstral analysis. To improve the noisy channel performance, we use traditional techniques: the leakage factor and sign function. The subjective speech quality of the proposed 16kb/s coder and G.726 coder in terms of the opinion equivalent Q is measured and compared. It is shown that the proposed coder produces much higher quality speech than that of 16kb/s G.726 at BER(Bit Error Rate)=0 and BER=10~3. Although the coder scores 4dB lower than 32kb/s G.726 at BER=0, the improvement of more than 5dB is achieved by the proposed coder over G.726 at BER=10~3.

Full Paper

Bibliographic reference.  Koishida, Kazuhito / Tokuda, Keiichi / Kobayashi, Takao / Imai, Satoshi (1994): "Speech coding based on adaptive MEL-cepstral analysis for noisy channels", In ICSLP-1994, 2087-2090.