A new filter design strategy to shorten the length of the filter is introduced for sinusoidal speech synthesis using cosinemodulated filter banks. Multiple sinusoidal waveforms for speech synthesis can be effectively synthesized by using pseudo-quadrature mirror filter (pseudo-QMF) banks, which are constructed by cosine modulation of the coefficients of a lowpass filter. This is because stable sinusoids are represented as sparse vectors on the subband domain of the pseudo-QMF banks and computation for the filter banks can be effectively performed with fast algorithms for discrete cosine transformation (DCT). However, the pseudo-QMF banks require relatively long filters to reduce noise caused by aliasing. In this study, a wider passband design with a perfect reconstruction (PR) QMF bank is introduced. The properties of experimentally designed filters indicated that the length of the filters can be reduced from 448 taps to 384 taps for 32-subband systems with less than -96dB errors where the computational cost for speech synthesis does not significantly increase.

DOI: `10.21437/SSW.2016-29`

Cite as

Nishizawa, N., Yazaki, T. (2016) Wide Passband Design for Cosine-Modulated Filter Banks in Sinusoidal Speech Synthesis. Proc. 9th ISCA Speech Synthesis Workshop, 178-183.

Bibtex

@inproceedings{Nishizawa+2016, author={Nobuyuki Nishizawa and Tomonori Yazaki}, title={Wide Passband Design for Cosine-Modulated Filter Banks in Sinusoidal Speech Synthesis}, year=2016, booktitle={9th ISCA Speech Synthesis Workshop}, doi={10.21437/SSW.2016-29}, url={http://dx.doi.org/10.21437/SSW.2016-29}, pages={178--183} }