Sixth International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA 2009)

Florence, Italy
December 14-16, 2009

A Bottom-Up Procedure to Extract Periodicity Structure of Voiced Sounds and its Application to Represent and Restoration of Pathological Voices

Hanae Itagaki (1), Masanori Morise (2), Ryuichi Nisimura (1), Toshio Irino (1), Hideki Kawahara (1)

(1) Wakayama University, Wakayama, Japan; (2) Ritsumeikan University, Japan

A bottom up procedure for extracting repetitive structures in speech sounds has been developed on the basis of a temporally stable representation of periodic sounds (TANDEM) and adaptive spectral smoothing (STRAIGHT). The proposed method evaluates local periodic structures in the frequency domain to detect repetition in the time domain. A group of dedicated periodicity detectors are combined to construct the proposed procedure for a repetitive structure extractor called an excitation structure extractor (XSX). The proposed procedure is tested using a set of stylized test signals with artificial shimmer and jitter to investigate the applicability of such aperiodic signals. The test results indicated that the proposed procedure outperformed in descriptive power of those complex excitation modes over existing FO detectors. Finally, the proposed procedure is applied to analyze pathological voice examples to investigate the feasibility of voice quality restoration applications.

Index Terms. periodicity extraction, fundamental frequency, TANDEM-STRAIGHT, XSX. apei iodicity, pathological voice

Full Paper (reprinted with permission from Firenze University Press)

Bibliographic reference.  Itagaki, Hanae / Morise, Masanori / Nisimura, Ryuichi / Irino, Toshio / Kawahara, Hideki (2009): "A bottom-up procedure to extract periodicity structure of voiced sounds and its application to represent and restoration of pathological voices", In MAVEBA-2009, 115-118.