4th International Conference on Spoken Language Processing

Philadelphia, PA, USA
October 3-6, 1996

A Stochastic Model of Fundamental Period Perturbation and Its Application to Perception of Pathological Voice Quality

Yasuo Endo, Hideki Kasuya

Faculty of Engineering, Utsunomiya University, Utsunomiya, Japan

This paper proposes a stochastic model of fundamental period perturbation and applies it to the perception of pathological voice quality. A fundamental period perturbation is generated by a second order auto-regressive moving average (ARMA) model which includes excitation by a random white noise. Standard deviation, pole frequency and pole bandwidth were used as parameters of the model. Sustained vowels were synthesized by systematically manipulating the model parameters and subjected to the perceptual experiment to understand roles of the parameters in the perceived quality of pathological voice. Five subjects participated in the experiment and were asked to judge whether a vowel sample was normal or pathological. It was found that perceptual impression of the pathological voice was associated not only with the magnitude but also the pole frequency and bandwidth of the fundamental period perturbation.

Full Paper

Bibliographic reference.  Endo, Yasuo / Kasuya, Hideki (1996): "A stochastic model of fundamental period perturbation and its application to perception of pathological voice quality", In ICSLP-1996, 772-775.