Speech Prosody 2008

Campinas, Brazil
May 6-9, 2008

Phoneme Dedicated ANN Improves Segmental Duration Model

João Paulo Teixeira (1), Diamantino Freitas (2)

(1) ESTiG - Instituto Politécnico de Bragança, Portugal
(2) Faculdade de Engenharia da Universidade do Porto, Portugal

The Phoneme Dedicated Artificial Neural Network (PDANN) segmental duration model consists of a set of ANNs trained specifically for each phoneme segment in order to avoid miscellaneous influence of different types of phoneme segments. Therefore, each ANN is dedicated to predict the duration of a specific phoneme segment. Objective and subjective measurements of the performance of the PDANN model were compared with those of a typical ANN model using the same input features and database. The results indicate a slight, but clear, perceptually perceived preference towards the PDANN.

Full Paper

Bibliographic reference.  Teixeira, João Paulo / Freitas, Diamantino (2008): "Phoneme dedicated ANN improves segmental duration model", In SP-2008, 371-374.