Speech Prosody 2008
The Phoneme Dedicated Artificial Neural Network (PDANN) segmental duration model consists of a set of ANNs trained specifically for each phoneme segment in order to avoid miscellaneous influence of different types of phoneme segments. Therefore, each ANN is dedicated to predict the duration of a specific phoneme segment. Objective and subjective measurements of the performance of the PDANN model were compared with those of a typical ANN model using the same input features and database. The results indicate a slight, but clear, perceptually perceived preference towards the PDANN.
Bibliographic reference. Teixeira, João Paulo / Freitas, Diamantino (2008): "Phoneme dedicated ANN improves segmental duration model", In SP-2008, 371-374.