We proposed an automatic method for determining the boundaries of prosodic phrases in real speech waves. In this method, the dynamic programming (DP) and the least mean square error (LMSE) methods were implemented based on the F0 generation model. In order to evaluate the accuracy and validity of this proposed method, a set of 973 standard Chinese speech sentences was selected. The cumulative proportion of the estimated prosodic phrase boundaries approached 76% when ET(0i) was less than the average duration of the prosodic phrases. Thus, it can be concluded that this proposed method can be used in the practical application.
Bibliographic reference. Bu, Shehui / Zhuo, Zhenjie / Yang, Lingling / Itahashi, Shuichi (2011): "Automatic determination of the standard Chinese prosodic phrase boundaries by f_0 generation model", In INTERSPEECH-2011, 1389-1392.