EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology

Aalborg, Denmark
September 3-7, 2001


Synthesizing Intonation of Standard Arabic Language

A. Zaki (1), A. Rajouani (2), M. Najim (1)

(1) Equipe Signal et Image, ENSEIRB, France
(2) Laboratoire d’Electronique et Etudes des Systèmes Automatiques Rabat, Morocco

In this paper, we propose a model to generate fundamental frequency (F0) contours using neural networks. A learning procedure is proposed as an alternative to synthesis-by-rules. The generation of correct fundamental frequency contour is one of the important issues in the naturalness of automatic text-to-speech conversion systems. The proposed approach is based on a standard feed-forward multi-layer network that produces global F0 contours of sentences, directly from encoded linguistic features of standard Arabic language. Our model does not need syntactic information to produce suitable declarative intonation. TD-PSOLA synthesizer is used for validation of our results.

Bibliographic reference.  Zaki, A. / Rajouani, A. / Najim, M. (2001): "Synthesizing intonation of standard arabic language", In EUROSPEECH-2001, 541-545.