13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

HMM-based Speech Synthesis Using Sub-band Basis Spectrum Model

Yamato Ohtani, Masatsune Tamura, Masahiro Morita, Takehiko Kagoshima, Masami Akamine

Knowledge Media Laboratory, Corporate Research & Development Center, Toshiba Corporation, Japan

In this paper, we propose a HMM-based text-to-speech (TTS) using sub-band basis spectrum model (SBM). SBM can represent vocal tract spectra and phase characteristics by liner combination of sub-band basis vectors. Some reports suggest that analysis-synthesized speech based on SBM is close to the natural speech and SBM can perform effectively in the text-to-speech. Therefore, SBM framework is expected to improve speech quality to have good effects on the HMM-based TTS. Subjective experimental results show that the proposed method improves speech quality in some conditions.

Index Terms: speech synthesis, hidden Markov model, sub-band basis spectrum model, phase feature

Full Paper

Bibliographic reference.  Ohtani, Yamato / Tamura, Masatsune / Morita, Masahiro / Kagoshima, Takehiko / Akamine, Masami (2012): "HMM-based speech synthesis using sub-band basis spectrum model", In INTERSPEECH-2012, 1440-1443.