4th International Conference on Spoken Language Processing

Philadelphia, PA, USA
October 3-6, 1996

Robust F0 and Jitter Estimation in Pathological Voices

Maurílio N. Vieira, Fergus R. McInnes, Mervyn A. Jack

CCER, Dept. of Electrical Engineering, University of Edinburgh, Edinburgh, Scotland, UK

Dysphonic voices were used to compare electroglottographic (EGG) and acoustic measures of fundamental frequency (F0) and jitter using a wavematching and an event-based technique. Continuous speech was considered in the first pan of the study, where the effects of pre-filtering the acoustic signals and linearly smoothing [he F0 contours were analysed. The second pan of the investigation compared jitter from sustained vowels (/i/, /a/, /a/), resulting in poor agreement for HI and Ai/. In W vowels, however, a relatively small mean normalised absolute difference (10.95%) was obtained with a method that is being proposed, which combines peak-picking and zero crossings, being able to detect a waveform pattern observed in such vowels and reject unreliable measures.

Full Paper

Bibliographic reference.  Vieira, Maurílio N. / McInnes, Fergus R. / Jack, Mervyn A. (1996): "Robust F0 and jitter estimation in pathological voices", In ICSLP-1996, 745-748.