Fourth European Conference on Speech Communication and Technology

Madrid, Spain
September 18-21, 1995

Development and Improvement of a Real-Time ASR System for Isolated Digits in Spanish over the Telephone Line

Ricardo de Cordoba, Xavier Menendez-Pidal, Javier Macias-Guarasa, Ascension Gallardo, Josť M. Pardo

Grupo de Tecnologia del Habla, Dpto. Ingenieria Electronica, ETSIT Madrid, UPM Ciudad Universitaria s/n, Madrid 28040, Spain

We present the development and characteristics of a basic ASR system for isolated digits in Spanish, used over the telephone line. Initially we will introduce our first idea, a basic discrete system, and then we will see the improvements we made to increase the recognition rate at a low CPU cost (always considering its practical implementation as a real time system). The most remarkable advances were obtained with: 1) Semicontinuous modelling. It is a more precise modelling, although more time consuming. 2) End-pointing with a Neural network. 3) One pass decoding with noise models. The intention of both 2 and 3 is to alleviate the effects of a wrong end-pointing. 4) Parametrization using perceptual filters in frequency and filtering in the time domain (RASTA-PLP). We wanted to decrease the effect of telephonic noise in our system.

Full Paper

Bibliographic reference.  Cordoba, Ricardo de / Menendez-Pidal, Xavier / Macias-Guarasa, Javier / Gallardo, Ascension / Pardo, Josť M. (1995): "Development and improvement of a real-time ASR system for isolated digits in Spanish over the telephone line", In EUROSPEECH-1995, 1537-1540.