Demo of Idlak Tangle, An Open Source DNN-Based Parametric Speech Synthesiser

Blaise Potard, Matthew P. Aylett, David A. Baude


We present a live demo of Idlak Tangle, a TTS extension to the ASR toolkit Kaldi [1]. Tangle combines the Idlak front-end and newly released MLSA vocoder, with two DNNs modelling respectively the units duration and acoustic parameters, providing a fully functional end-to-end TTS system. The system has none of the licensing restrictions of currently available HMM style systems, such as the HTS toolkit, and can be used free of charge for any type of applications. Experimental results using the freely available SLT speaker from CMU ARCTIC, reveal that the speech output is rated in a MUSHRA test as significantly more natural than the output of HTS-demo. The tools, audio database and recipe required to reproduce the results presented are fully available online at https://github.com/bpotard/idlak . The live demo will allow participants to measure the quality of TTS output on several ARCTIC voices, and on voices created from commercial-grade recordings.


Cite as

Potard, B., Aylett, M.P., Baude, D.A. (2016) Demo of Idlak Tangle, An Open Source DNN-Based Parametric Speech Synthesiser. Proc. 9th ISCA Speech Synthesis Workshop, 126-126.

Bibtex
@inproceedings{Potard+2016,
author={Blaise Potard and Matthew P. Aylett and David A. Baude},
title={Demo of Idlak Tangle, An Open Source DNN-Based Parametric Speech Synthesiser},
year=2016,
booktitle={9th ISCA Speech Synthesis Workshop},
pages={126--126}
}