ISCA Archive ICSLP 1990 Sessions
  ISCA Archive Sessions

First International Conference on Spoken Language Processing

Kobe, Japan
18-22 November 1990

Speech Recognition and Enhancement

Enhanced parametric representation using binarized spectrum
Ki Chul Kim, Hyun Soo Yoon, Jung Wan Cho

Voiced-unvoiced classification using weighted distance measures
Kiyoshi Asai, Shigeru Chiba

Phoneme recognition using a hierarchical time spectrum pattern
Kei Miki

Recognition of plosive using mixed features by fisher's linear discriminant
Susumu Sato, Takeshi Fukabayashi

Clustering algorithms to minimize recognition error function and their applications to the vowel template learninig
Akio Ando, Kazuhiko Ozeki

Chinese four tone recognition based on the model for process of generating F0 contours of sentences
Changfu Wang, Hiroya Fujisaki, Keikichi Hirose

Generalized training of hidden Markov model parameters for speech recognition
N. S. Kim, Chong Kwan Un

Phoneme recognition by combining Bayesian linear discriminations of selected pairs of classes
Tatsuya Kawahara, Toru Ogawa, Shigeyoshi Kitazawa, Shuji Doshita

WAL - a speech recognition programming language
S. Atkins, P. Kenne, D. Landy, S. Nulsen, M. O'Kane

Automatic segmentation: why and what segments?
Mario Rossi

A Japanese text dictation system based on phoneme recognition using a modified LVQ2 method
Shozo Makino, Akinori Ito, Mitsuru Endo, Ken'iti Kido

An optimal discriminative training method for continuous mixture density HMMs
Shinobu Mizuta, Kunio Nakajima

Discrimination of words in a large vocabulary speech recognition system
S. Datta, M. Al-Zabibi

A recognition time reduction algorithm for large-vocabulary speech recognition
J. M. Koo, Chong Kwan Un, Hwang Soo Lee, H. R. Kim, M. W. Koo

Speech recognition method based on the dual processing nature of speech perception
Hyung Soon Kim, Chong Kwan Un

Speaker adaptation for demi-syllable based speech recognition using continuous HMM
Koichi Shinoda, Ken-ichi Iso, Takao Watanabe

Speech signal processing on a neurocomputer
Toby Skinner

Syllable structure parsing for continuous speech recognition
Shigeru Ono

An accelerator for high-speed spoken word-spotting and noise immunity learning system
Hiroyuki Tsuboi, Hiroshi Kanazawa, Yoichi Takebayashi

Recognition of standard malaysian language pronunciation
Zainul Abidin Md. Sharrif, Masuri Othman, Mohammad Ibrahim AKB Maiden

The SAPHA acoustic-phonetic decoder system for standard Arabic
M. Djoudi, Jean-Paul Haton

A concept for a cocktail-party-processor
Markus Bodden

Remote control system using speech-reduction of known noise
Tsuyoshi Usagawa, Yuji Morita, Masanao Ebata

Lombard speech recognition by formant-frequency-shifted LPC cepstrum
Yumi Takizawa, Masahiro Hamada

A robust distance measure based on group delay difference weighted by power spectra
Hiroshi Matsumoto, Hirokazu Mitsui

Speech enhancement using group delay functions
B. Yegnanarayana, Hema A. Murthy, V. R. Ramachandran

Recovery of reverberated speech using multi-microphone sub-band envelope estimation
Hong Wang, Fumitada Itakura

DISPE: a divers' speech data-base
Alain Marchal, Marie-Hélène Casanova, P. Gavarry, M. Avon

Synthesis of Spoken Language

Lexical components in rule-based speech systems
Rolf Carlson, Björn Granström, Sheri Hunnicutt

The integration of linguistic levels in a text-to-speech conversion system
Ken Ceder, Bertil Lyberg

The linguistic processing module for Japanese text-to-speech system
Tohru Shimizu, Norio Higuchi, Hisashi Kawai, Seiichi Yamamoto

A neural network approach to multi-language text-to-speech system
Yukiko Yamaguchi, Tatsuro Matsumoto

Proposal and evaluation of a new type of terminal analog speech synthesizer
Hiroya Fujisaki, Keikichi Hirose, Yasuharu Asano

The interrelationship of intelligibility and naturalness in text-to-speech
Bathsheba J. Malsheen, Mariscela Amador-Hernandez

Segment selection and pitch modification for high quality speech synthesis using waveform segments
Tomohisa Hirokawa, Kazuo Hakoda

On the unit search criteria and algorithms for speech synthesis using non-uniform units
Kazuya Takeda, Katsuo Abe, Yoshinori Sagisaka

Speech synthesis using superposition of sinusoidal waves generated by synchronized oscillators
Katsuhiko Shirai, Y. Sato, K. Hashimoto

Time-frequency spectral analysis of speech
David Rainton, S. J. Young

Inductive learning of grapheme-to-phoneme rules
Bert Van Coile

A support environment based on rule interpreter for synthesis by rule
Yoichi Yamashita, Hiroyuki Fujiwara, Yasuo Nomura, Nobuyoshi Kaiki, Riichiro Mizoguchi

Speech synthesis using demisyllables for Korean: a preliminary system
Jung-Chul Lee, Yong-Ju Lee, Hee-il Han, Eung-Bae Kim, Chang-Joo Kim, Kyung-Tae Kim

The rules in a Korean text-to-speech system
Seung-Kwon Ahn, Koeng-Mo Sung

Mandarin speech synthesis by the unit of coarticulatory demi-syllable
Chi-Shi Liu, Wern-Jun Wang, Shiow-Min Yu, Hsiao-Chuan Wang

A study on various prosody styles in Japanese speech synthesizable with the text-to-speech system
Ryunen Teranishi

Japanese text-to-speech conversion system
Hiroki Kamanaka, Takashi Yazu, Keiichi Chihara, Makoto Morito

Neural network based concatenation method of synthesis units for synthesis by rule
Yasushi Ishikawa, Kunio Nakajima

Improvement of the synthetic speech quality of the formant-type speech synthesizer and its subjective evaluation
Norio Higuchi, Hisashi Kawai, Tohru Shimizu, Seiichi Yamamoto

A parametric model of speech signals: application to high quality speech synthesis by spectral and prosodic modifications
Thierry Galas, Xavier Rodet

The improved source model for high-quality synthetic speech sound
Tomoki Hamagami, Shinichiro Hashimoto

A new Japanese text-to-speech synthesizer based on COC synthesis method
Kazuo Hakoda, Shin-ya Nakajima, Tomohisa Hirokawa, Hideyuki Mizuno

A parallel multialgorithmic approach for an accurate and fast English text to speech transcriber
G. M. Asher, K. M. Curtis, J. Andrews, J. Burniston

A highly programmable formant speech synthesiser utilising parallel processors
K. M. Curtis, G. M. Asher, S. E. Pack, J. Andrews

Enhancement of human-computer interaction through the synthesis of nonverbal expressions
Kris Maeda, Yasuki Yamashita, Yoichi Takebayashi

Duration, pitch and diphones in the CSTR TTS system
W. Nick Campbell, Stephen D. Isard, Alex I. C. Monaghan, J. Verhoeven

A Chinese fundamental frequency synthesizer based on a statistical model
Sin-Horng Chen, Su-Min Lee, Saga Chang

A contribution to the synthesis of Italian intonation
Cinzia Avesani

Pause rule for Japanese text-to-speech conversion using pause insertion probability
Kazuhiko Iwata, Yukio Mitome, Takao Watanabe

Analysis and modeling of tonal features in polysyllabic words and sentences of the standard Chinese
Hiroya Fujisaki, Keikichi Hirose, Pierre Halle, Haitao Lei

Voice response unit embedded in factory automation systems
Akira Yamamura, Hiroharu Kunizawa, Noboru Ueji, Hiroshi Itoyama, Osamu Kakusho

Tetos - a text-to-speech system for German
Klaus Wothke

A written text processing expert system for text to phoneme conversion
Michel Divay

Trial production of a module for speech synthesis by rule
Mikio Yamaguchi

Speech Production, Prosody and Analysis

MRI (magnetic resonance imaging) film of articulatory movements
A. K. Foldvik, O. Husby, J. Kvaerness, I. C. Nordli, P. A. Rinck

Modeling of 3-dimensional vocal tract shapes obtained by magnetic resonance imaging for speech synthesis
Masafumi Matsumura, Atsushi Sugiura

Ultrasonic measurement of tongue motion
Tokihiko Kaburagi, Masaaki Honda

Measurement of sound wave characteristics in the vocal tract
Kunitoshi Motoki, Nobuhiro Miki, Nobuo Nagai

Speech production model involving subglottal structure and oral-nasal coupling through closed velum
Hisayoshi Suzuki, Takayoshi Nakai, Jiauwu Dang, Chengxiang Lu

Articulatory characteristics of lip shape during the production of Japanese
Yorinobu Sonoda, Keisuke Mori, Tetsuaki Kuriyama

Sequential control model of speech articulation in producing word utterance
Naoki Kusakawa, Kiyoshi Honda, Yuki Kakita

Sternohyoid muscle activity and pitch control at the onset of utterances
Zyun'ici B. Simada, Satoshi Horiguchi, Seiji Niimi, Hajime Hirose

Prosodic features marking the major syntactic boundary of Japanese: a study on syntactically ambiguous sentences of the kinki dialect
Junichi Azuma, Yoshimasa Tsukuma

Automatic segmentation and alignment of continuous speech based on temporal decomposition model
H. D. Wang, Gérard Bailly, D. Tuffelli

Voiced/unvoiced/silence classification of spoken Korean
Hee-Il Hahn, Minsoo Hahn

Vocal pauses in teaching: statistical analysis and applications
E. Angderi, M. Barsotti, L. Mazzei, L. Vttrano, R. Volpentesta

A pitch detector based on event detection using the dyadic wavelet tranform
Shubha Kadambe, Gloria F. Boudreaux-Bartels

Proposal and evaluation of a new scheme for reliable pitch extraction of speech
Hiroya Fujisaki, Keikichi Hirose, Shigenobu Seto

Spectral interpolation using distortion geodesic lines
Masahide Sugiyama

Adaptive speech processing using an accelerated stochastic approximation method
Hirofumi Yogo, Naoki Inagaki

Perception, Impairments/Aids, Phonetics in Language Teaching and Speech Coding

Effects of temporal factors on the speech perception of the hearing impaired
Akiko Hayashi, Satoshi Imaizumi, Takehiko Harada, Hideaki Seki, Hiroshi Hosoi

Analysis of temporal coordination between articulatory movements and pitch control in the realization of Japanese word accent by a patient with apraxia of speech
Shinobu Masaki, Itaru F. Tatsumi, Sumiko Sasanuma

Multiband dynamic range compression sound processing for hearing impaired patients: effect on intelligibility of speech in background noise
Brian C. J. Moore, Jeannette Seloover Johnson, Vincent Pluvinage, Teresa M. Clark

New graphical expression of the high-speed palatographic data in study of the articulatory behaviors of the tongue
Takao Mizutani, Kiyoshi Hashimoto, Masahiko Wakumoto, Ken-ich Michi, Hareo Hamada, Tanetoshi Miura

Aging in the rate and regularity of maximum syllable repetition under bite-block
Makoto Kariyasu, Kukiko Maruyama

Vowel quantity contrast in Korean: production and perception
Minje Zhi, Yong-Ju Lee

Phonetic correlates of stress in mongolian
Jan-Olof Svantesson

Syllable final stops LN east asian languages: southern Chinese, Thai and Korean
Ray Iwata, Hajime Hirose, Seiji Niimi, Masayuki Sawashima, Satoshi Horiguchi

An electromyographic study on laryngeal adjustment for production of the light tone in Mandarin Chinese
Seiji Niimi, Qun Yan, Satoshi Horiguchi, Hajime Hirose

A comparison of the articulation of the Chinese /i,l,l/ by Chinese and Japanese speakers
Jingxu Cui, Shuichi Itahashi

The durations of Japanese long vowels and geminated consonants uttered by indonesian
Hirotake Nakashima, Masao Yamaguchi

On phrasing of Japanese language learners
Izumi Saita

PROTS (pronunciation training system)
Kawai Musical Instruments Kawai Musical Instruments

Constrained-stochastic excitation coding of speech at 4.8 kb/s
Yair Shoham

Adaptive transform coding with an adaptive block size using a modified DCT
Fumie Hazu, Akihiko Sugiyama, Masahiro Iwadare, Takao Nishitani

Medium-delay 8 kbit/s speech coder based on conditional pitch prediction
Takehiro Moriya

A low rate VQ speech coding algorithm with variable transmission frame length
Sung Ro Lee, Hwang Soo Lee, Chong Kwan Un

Neural Networks for Speech Processing I, II

Speech recognition using demi-syllable neural prediction model
Ken-ichi Iso, Takao Watanabe

Phonetic features extraction using time-delay neural networks
Frédéric Bimbot, Gerard Chollet, Jean-Pierre Tubach

Vowel recognition by phoneme filter neural networks
Masami Nakamura, Shinichi Tamura

A comparison of two methods to transcribe speech into phonemes: a rule-based method vs. back-propagation
Kari Torkkola, Mikko Kokkonen

Phoneme recognition by pairwise discriminant TDNNs
Jun-Ichi Takami, Shigeki Sagayama

Speaker independent speech recognition based on neural networks of each category with embedded eigenvectors
Yasuyuki Masai, Hiroshi Matsu'ura, Tsuneo Nitta

Speech recognition using sub-phoneme recognition neural network
Kiyoaki Aikawa, Alexander H. Waibel

Speech recognition based on the integration of FSVQ and neural network
Li-Qun Xu, Tie-Cheng Yu, G. D. Tattersall

Fast text-to-speech learning
Samir I. Sayegh

Continuous speech recognition on the resource management database using connectionist probability estimation
Nelson Morgan, C. Wooters, Hervé Bourlard, Michael Cohen

Neural predictive hidden Markov model
Eiichi Tsuboka, Yoshihiro Takada, Hisashi Wakita

On the robustness of HMM and ANN speech recognition algorithms
Yasuhiro Minami, Toskiyuki Hanazawa, Hitoshi Iwamida, Erik McDermott, Kiyohiro Shikano, Shigeru Katagiri, Masaona Kagawa

The TDNN-LR large-vocabulary and continuous speech recognition system
Hidefumi Sawai

Rule-driven neural networks for acoustic-phonetic decoding
Remy Bulot, Henri Meloni, Pascal Nocera

Knowledge-based segmentation and feature maps for speech recognition
Franck Poirier

Speaker-independent English alphabet recognition: experiments with the e-set
Mark Fanty, Ron Cole

Neural network based segmentation of continuous speech
Pinaki Poddar, P. V. S. Rao

A normalization of coarticulation of connected vowels using neural network
Tomio Takara, Motonori Tamaki

Lip-reading of Japanese vowels using neural networks
Tomio Watanabe, Masaki Kohda

Application of the compositional representation to lexical access using neural networks
H. Lucke, Frank Fallside

A voice input-output system using isolated words
Abdul Mobin, S. S. Agrawal, Anil Kumar, K. D. Pavate

A psycholinguistic model of first and second language learning
Tatiana Slama-Cazacu

Assessment / Human Factors, Database and Neural Networks

A note on loud and lombard speech
Zinny S. Bond, Thomas J. Moore

A weighted intelligibility measure for speech assessment
Ute Jekosch

Improvements in binaural articulation score by simulated localization using head-related transfer functions
Shinji Hayashi

Evaluating synthesiser performance: is segmental intelligibility enough?
Kim Silverman, Sara Basson, Suzi Levas

Media conversion into language and voice for intelligent communication
Fumio Maehara, Masamichi Nakagawa, Kunio Nobori, Toshiyuki Maeda, Tsutomu Mori, Makoto Fujimoto

Segmental intelligibility of synthetic and natural speech in real and nonsense words
Rolf Carlson, Björn Granström, Lennart Nord

The HKU-USTC speech corpus
Chorkin Chan, Ren-hua Wang

Automatic alignment of phonemic labels with continuous speech
Torbjorn Svendsen, Knut Kvale

TELS: a speech time-expansion labelling system
D. Tuffelli, H. D. Wang

A speech labeling system based on knowledge processing
Kazuhiro Arai, Yolchi Yamashita, Tadahiro Kitahashi, Riichiro Mizoguchi

Development and experimental use of phonwork a new phonetic workbench
Hans G. Tillmann, Maximilian Hadersbeck, Hans Georg Piroth, Barbara Eisen

A speech recognition research environment based on large-scale word and concept dictionaries
Hiroyuki Chimoto, Hideaki Shinchi, Hideki Hashimoto, Shinya Amano

Are laboratory databases appropriate for training and testing telephone speech recognizers?
Benjamin Chigier, Judith Spitz

Standardisation of speech input assessment within the SAM esprit project
Sven W. Danielsen

Multilingual speech data base for evaluating quality of digitized speech
Hiroshi Irii, Kenzo Ito, Nobuhiko Kitawaki

The optimal gain sequence for fastest learning in connectionist vector quantiser design
Lizhong Wu, Frank Fallside

A comparison of preprocessors for the cambridge recurrent error propagation network speech recognition system
Tony Robinson, John Holdsworth, Roy Patterson, Frank Fallside

A recurrent neural network for word identification from phoneme sequences
R. B. Allen, C. Kamm, S. B. James

Improved broad phonetic classification and segmentation with a neural network and a new auditory model
Lieven Depuydt, Jean-Pierre Martens, Luc Van Immerseel, Nico Weymaere

Formant extraction model by neural networks and auditory model based on signal processing theory
Kazuaki Obara, Hideyuki Takagi

/b,d,g/ recognition with elliptic discrimination neural units
Noboru Kanedera, Tetsuo Funada

A comparative study of acoustic representations of speech for vowel classification using multi-layer perceptrons
Helen M. Meng, Victor W. Zue

Extended elman's recurrent neural network for syllable recognition
Yong Duk Cho, Ki Chul Kim, Hyun Soo Yoon, Seung Ryoul Maeng, Jung Wan Cho

Detection and classification of phonemes using context-independent error back-propagation
Hong C. Leung, James R. Glass, Michael S. Phillips, Victor W. Zue

A new method of consonant detection and classification using neural networks
Shigeru Chiba, Kiyoshi Asai

An artificial neural network for the burst point detection
Shigeyoshi Kitazawa, Masahiro Serizawa

The use of discriminant neural networks in the integration of acoustic cues for voicing into a continuous-word recognition system
Claude Lefebvre, Dariusz A. Zwierzynski

A neural network for speaker-independent isolated word recognition
Kouichi Yamaguchi, Kenji Sakamoto, Toshio Akabane, Yoshiji Fujimoto

Continuous Speech Recognition and Speaker Recognition

Vocabulary independent phrase recognition with a linear phonetic context model
Yoshiharu Abe, Kunio Nakajima

Phoneme probability presentation of continuous speech
Y. Ariki, Mervyn A. Jack

Duration constraints for the speech input interface in the MULTIWORKS project
Haiyan Ye, Jean Caelen

Chinese continuous speech recognition system using the state transition models both of phonemes and words
Hu Zhi-ping, Imai Satoshi

A new training method for multi-phone speech units for use in a hidden Markov model speech recognition system
Jade Goldstein, Akio Amano, Hideki Murayama, Mariko Izawa, Akira Ichikawa

Diction for phoneme/syllable/word-category and identification of language using HMM
Yoshio Ueda, Seiichi Nakagawa

Performance evaluation in speech recognition system using transition probability between linguistic units
Takashi Otsuki, Shozo Makino, Toshio Sone, Ken'iti Kido

Sentence recognition method using word cooccurrence probability and its evaluation
Isao Murase, Seiichi Nakagawa

A knowledge-based understanding system for the Chinese spoken language
Yanghai Lu, Beiqian Dai

Conversational speech understanding based on cooperative problem solving
Akio Komatsu, Eiji Oohira, Akira Ichikawa

A one-pass search algorithm for continuous speech recognition directed by context-free phrase structure grammar
Michio Okada

A blackboard architecture for a word hypothesizer and a chart parser interaction in an ASR system
Andrea Di Carlo, Rino Falcone

Heuristic search problems in a natural language task oriented spoken man-machine dialogue system
P. Mousel, Jean-Marie Pierrel, A. Roussanaly

The making of a speech-to-speech translation system: some findings from the dmdialog project
Hiroaki Kitano

Using high level knowledge sources as a means of recovering DLL-formed Japanese sentences distorted by ambient noise
K. H. Loken-Kim, Yasuhiro Nara, Shinta Kimura

Tools for designing dialogues in speech understanding interfaces
Anders Baekgaard, Paul Dalsgaard

A method for expressing associative relations using fuzzy concepts -aiming at advanced speech recognition-
Osamu Takizawa, Masuzo Yanagida

Bilingual speech interface for a bidirectional machine translation system
Jean-Pierre Tubach, Raymond Descout, Pierre Isabelle

Optimum spectral peak track interpretation in terms of formants
Yves Laprie

A speech understanding system
Spriet Thierry

Speaker based on multipulse excitation and UPC vocal-tract model
Seiichiro Hangai, Kazvhiro Miyauchi

A neural network based speaker verification system
I-Chang Jou, Su-Ling Lee, Min-Tau Lin, Chih-Yuan Tseng, Shih-Shien Yu, Yuh-Juain Tsay

Speaker recognition using static and dynamic CEPSTRAL feature by a learning neural network
Hujun Yin, Tong Zhou


Temporal Control in the Spoken Language

Speech Analysis

Voice Source Dynamics; Facts and Models

Speech Coding and Transmission

Extraction and Processing of Voice Individuality

Voice Source Characteristics and Synthesis

Speech Recognition and Enhancement

Synthesis of Spoken Language

Phoneme Recognition

Recent Progress in Speech Perception Research

Speech Production, Prosody and Analysis

The Role of Prosody in Production and Perception of Spoken Language

Word Recognition

Perception of Spoken Language

Perception, Impairments/Aids, Phonetics in Language Teaching and Speech Coding

Neural Networks for Speech Processing I, II

Continuous Speech Recognition

Modeling of First and Second Language Acquisition

Application of Speech Recognition / Synthesis Technologies

Language Modeling

Phonetics and Phonology

Assessment / Human Factors, Database and Neural Networks

Speech I/O Assessment and Database I, II

Speech Recognition in Noisy Environments

Foreign Language Teaching

Continuous Speech Recognition and Speaker Recognition

Dialogue Modeling and Processing

Language Acquisition

Plenary Lectures