5th European Conference on Speech Communication and Technology

Rhodes, Greece
September 22-25, 1997

CDHMM Speaker Recognition by Means of Frequency Filtering of Filter-Bank Energies

Javier Hernando, Climent Nadeu

Universitat Politecnica de Catalunya, Barcelona, Spain

Recently, the set of spectral parameters of every speech frame that result from filtering the frequency sequence of mel-scaled filter-bank energies with a simple first-order high-pass FIR filter have proved to be an efficient speech representation in terms of both speech recognition rate and computational load. In this paper, we apply the same technique to speaker recognition. Frequency filtering approximately equalizes the cepstrum variance, enhancing the oscillations of the spectral envelope curve that are most effective for discriminating between speakers. In this way, even better speaker identification results than using conventional mel-cepstrum were observed in continuous observation Gaussian density HMM, especially in noisy conditions.

Full Paper

Bibliographic reference.  Hernando, Javier / Nadeu, Climent (1997): "CDHMM speaker recognition by means of frequency filtering of filter-bank energies", In EUROSPEECH-1997, 2363-2366.