Speech Prosody 2004

Nara, Japan
March 23-26, 2004

Input Prediction Method of Speech Front End Processor Using Prosodic Information

Masahiro Araki, Hiroyoshi Ohmiya, Satoshi Kida

Department of Electronics and Information Science, Kyoto Institute of Technology, Japan

In general, prosody of speech contains various information. For example, in Japanese, accent information is used for distinguishing homonyms and identifying word boundaries. In this paper, we propose a combination method of phonetic and prosodic information in speech applications, that is, an input prediction front end processor for dictation. From a few morae inputs, completion candidates that are sorted by input history and by the accent pattern are listed up. We examined two accent usage methods for both registered words and unregistered words and implemented an input prediction system combining a speech recognizer, a prediction server and an accent usage module.

Full Paper

Bibliographic reference.  Araki, Masahiro / Ohmiya, Hiroyoshi / Kida, Satoshi (2004): "Input prediction method of speech front end processor using prosodic information", In SP-2004, 501-504.