Ninth International Conference on Spoken Language Processing

Pittsburgh, PA, USA
September 17-21, 2006

An HMM-Based Singing Voice Synthesis System

Keijiro Saino, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda

Nagoya Institute of Technology, Japan

The present paper describes a corpus-based singing voice synthesis system based on hidden Markov models (HMMs). This system employs the HMM-based speech synthesis to synthesize singing voice. Musical information such as lyrics, tones, durations is modeled simultaneously in a unified framework of the context-dependent HMM. It can mimic the voice quality and singing style of the original singer. Results of a singing voice synthesis experiment show that the proposed system can synthesize smooth and natural-sounding singing voice.

Full Paper

Bibliographic reference.  Saino, Keijiro / Zen, Heiga / Nankaku, Yoshihiko / Lee, Akinobu / Tokuda, Keiichi (2006): "An HMM-based singing voice synthesis system", In INTERSPEECH-2006, paper 2077-Thu1BuP.7.