4th International Conference on Spoken Language Processing

Philadelphia, PA, USA
October 3-6, 1996

Detection of Phrase Boundaries in Japanese by Low-Pass Filtering of Fundamental Frequency Contours

Atsuhiro Sakurai, Keikichi Hirose

Dept. of Information and Communication Engineering, University of Tokyo, Japan

Major syntactic boundaries are often accompanied by a rise in the phrase component of the fundamental frequency (F0) contour. Detecting such rises, therefore, can be significantly helpful to the speech recognition process. We developed a method to detect syntactic boundaries with phrase-component rise (henceforth, phrase boundaries), based on the compression of the accent component of the F0 contour (in logarithmic scale), using a low-pass filter. In this method, F0 contours are viewed as signals in the time domain, which can be roughly separated into phrase and accent components due to their different frequency contents. Phrase boundaries are detected whenever a significant rise occurs in the derivative of the filtered F0 contour. (The concepts of phrase and accent components can be found in [1]). The method managed to detect about 77% of manually detectable phrase boundaries, though with a relatively high insertion rate. The insertion rate can be reduced by using the partial AbS method, proposed by the authors [7].

