ISCA Archive ICSLP 1994 Sessions
  ISCA Archive Sessions
top

3rd International Conference on Spoken Language Processing

Yokohama, Japan
18-22 September 1994







Statistical Methods for Speech Recognition


Speech recognition using HMM with decreased intra-group variation in the temporal structure
Nobuaki Minematsu, Keikichi Hirose

Spoken word recognition using phoneme duration information estimated from speaking rate of input speech
Yukihiro Osaka, Shozo Makino, Toshio Sone

State duration constraint using syllable duration for speech recognition
Yumi Wakita, Eiichi Tsuboka

Statistical modeling and recognition of rhythm in speech
Satoru Hayamizu, Kazuyo Tanaka

Recognition of Chinese tones in monosyllabic and disyllabic speech using HMM
Xinhui Hu, Keikichi Hirose

Chinese speech understanding and spelling-word translation based on the statistics of corpus
Jun Wu, Zuoying Wang, Jiasong Sun, Jin Guo

State-codebook based quasi continuous density hidden Markov model with applications to recognition of Chinese syllables
Ren-Hua Wang, Hui Jiang

Estimating linear discriminant parameters for continuous density hidden Markov models
Eluned S. Parris, Michael J. Carey

Discriminative state-weighting in hidden Markov models
F. Wolfertstetter, Günther Ruske

Speech recognition using tree-structured probability density function
Takao Watanabe, Koichi Shinoda, Keizaburo Takagi, Eiko Yamada

Prediction of word confusabilities for speech recognition
David B. Roe, Michael D. Riley

A comparison study of output probability functions in HMMs through spoken digit recognition
Li Zhao, Hideyuki Suzuki, Seiichi Nakagawa

Connected spoken word recognition using a many-state Markov model
Tomio Takara, Naoto Matayoshi, Kazuya Higa

Global optimisation of HMM input transformations
Finn Tore Johansen

Nonstationary-state hidden Markov model with state-dependent time warping: application to speech recognition
D. Sun, L. Deng

Automatic word recognition based on second-order hidden Markov models
Jean-Francois Mari, Jean-Paul Haton

On the application of multiple transition branch hidden Markov models to Chinese digit recognition
Xixian Chen, Yinong Li, Xiaoming Ma, Lie Zhang

Parallel model combination on a noise corrupted resource management task
M. J. F. Gales, Steve J. Young

Robust signal preprocessing for HMM speech recognition in adverse conditions
Jean-Baptiste Puel, Regine André-Obrecht

A study on viterbi best-first search for isolated word recognition using duration-controlled HMM
Masaharu Katoh, Masaki Kohda

An HMM duration control algorithm with a low computational cost
Satoshi Takahashi, Yasuhiro Minami, Kiyohiro Shikano

Fast log-likelihood computation for mixture densities in a high-dimensional feature space
Peter Beyerlein

Time synchronous heuristic search in a stochastic segment based recognizer
Nick Cremelie, Jean-Pierre Martens

Applying speech verification to a large data base of German to obtain a statistical survey about rules of pronunciation
Maria-Barbara Wesenick, Florian Schiel

Structure of allophonic models and reliable estimation of the contextual parameters
D. Jouvet, Katarina Bartkova, A. Stouff

A probabilistic framework for word recognition using phonetic features
Christoph Windheuser, Frédéric Bimbot, Patrick Haffner

Nonlinear time alignment in stochastic trajectory models for speech recognition
Mohamed Afify, Yifan Gong, Jean-Paul Haton

Connected digit recognition using connectionist probability estimators and mixture-Gaussian densities
David M. Lubensky, Ayman O. Asadi, Jayant M. Naik

A trellis-based implementation of minimum error rate training
Kazuya Takeda, Tetsunori Murakami, Shingo Kuroiwa, Seiichi Yamamoto

Concatenated training of subword HMMs using detected labels
Me Yi

An initial study on speaker adaptation for Mandarin syllable recognition with minimum error discriminative training
Chih-Heng Lin, Pao-Chung Chang, Chien-Hsing Wu


Phonetics & Phonology I, II


Phonetic underspecification in schwa
Yuko Kondo

Some remarks on the compound accent rule in Japanese
Shin'ichi Tanaka, Haruo Kubozono

Modifications of acoustic features in Russian connected speech
Rodmonga K. Potapova

A prosodic analysis of three sentence types with "WH" words in Korean
Sun-Ah Jun, Mira Oh

Distinguishing the voiceless fricatives f and TH in English: a study of relevant acoustic properties
Kazue Hata, Heather Moran, Steve Pearson

Correlation analysis between speech power and pitch frequency for twenty spoken languages
Kenzo Itoh

On gestural reduction and gestural overlap in Korean and English /PK/ clusters
Jongho Jun

Intonation contours and the prominence of F0 peaks
Carlos Gussenhoven, Toni Rietveld

Phonation types analysis in standard Chinese
Agnès Belotel-Grenié, Michel Grenié

Accent phrase segmentation by finding n-best sequences of pitch pattern templates
Mitsuru Nakai, Hiroshi Shimodaira

Sound similarity judgments and segment prominence: a cross-linguistic study
Bruce L. Derwing, Terrance M. Nearey

Analysis of accent and intonation in Spanish based on a quantitative model
Hiroya Fujisaki, Sumio Ohno, Kei-ichi Nakamura, Miguelina Guirao, Jorge Gurlekian

Italian clusters in continuous speech
Edda Farnetani, M. Grazia Busa

Rhythmic constraints in durational control
Cynthia Grover, Jacques Terken

Further evidence for bi-moraic foot in Japanese
Kazutaka Kurisu

A model for generating self-repairs
Yuji Sagawa, Masahiro Ito, Noboru Ohnishi, Noboru Sugie

Accent identification with a view to assisting recognition (work in progress)
Chris Cleirigh, Julie Vonwiller

Phonetic, phonological, morpho-syntactic and semantic functons of segmental duration in spoken telugu: acoustic evidence
K. Nagamma Reddy

Timing strategies within the paragraph
Zita McRobbie-Utasi

The effect of the following vowel on the frequency normalization in the perception of voiceless stop consonants
Sotaro Sekimoto

Features of prominent particles in Japanese discourse, frequency, functions and acoustic features
Toshiko Muranaka, Noriyo Hara

Vowel quality assessment based on analysis of distinctive features
Shuping Ran, Bruce Millar, Iain Macleod

Differences in the fluctuation of attention during the listening of natural and synthetic passages
Cristina Delogu, Stella Conte, Giro Sementina

Production and perception of words with identical segmental structure but different number of syllables
Barbara Heuft, Thomas Portele

Generation of pronunciations from orthographies using transformation-based error-driven learning
Caroline B. Huang, Mark A. Son-Bell, David M. Baggett

Characteristics of mispronunciation and hesitation in Japanese tongue twister
Hidenori Usuki, Jouji Suzuki, Tetsuya Shimamura

A duration study of speech vowels produced in noise
Jean-Claude Junqua

PROTRAN: a prosody transplantation tool for text-to-speech applications
Bert Van Coile, L. Van Tichelen, Annemie Vorstermans, J. W. Jang, M. Staessen

Complementary phonology a theoretical frame for labelling an acoustic data base of dialogues
Klaus J. Kohler

Distribution of devoiced high vowels in Korean
Sun-Ah Jun, Mary E. Beckman

CV as a phonological unit in Korean
Yeo Bom Yoon

Experiments on the syllable in hindi
Manjari Ohala

Towards a universal, phonetically-based, theory of vowel harmony
John J. Ohala

Perceptual parsing of nasal vowels
John Ingram, Tom Mylne

On the perceptual distance between speech segments
Oded Ghitza, M. Mohan Sondhi

Perception of central vowel with pre- and post-anchors
Masato Akagi, Astrid van Wieringen, Louis C. W. Pols

Phonological mechanisms of French speech errors
Mario Rossi, Evelyne Peter-Defare, Regine Vial

Phonetic prototypes: modelling the effects of speaking rate on the internal structure of a voiceless category using recurrent neural networks
Mukhlis Abu-Bakar, Nick Chater

EPG and acoustic study of some connected speech processes
William J. Hardcastle

Syllable timing computation in the c/d model
Osamu Fujimura

Contribution of psycholinguistic perspective for speech technologies
Tatiana Slama-Cazacu


Adaption and Training for Speech Recognition


An unsupervised speaker adaptation method for continuous parameter HMM by maximum a posteriori probability estimation
Yutaka Tsurumi, Seiichi Nakagawa

Unsupervised speaker adaptation for speech recognition using demi-syllable HMM
Koichi Shinoda, Takao Watanabe

Minimum error rate training of inter-word context dependent acoustic model units in speech recognition
W. Chou, C.-E. Lee, Biing-Hwang Juang

Incremental speaker adaptation using phonetically balanced training sentences for Mandarin syllable recognition based on segmental probability models
Jia-lin Shen, Hsin-min Wang, Ren-yuan Lyu, Lin-shan Lee

Incremental training of a speech recognizer for voice dialling-by-name
Lorenzo Fissore, Giorgio Micca, F. Ravera

Speaker adaptation of continuous density HMMs using multivariate linear regression
C. J. Leggetter, Phil C. Woodland

Speaker adaptation based on transfer vectors of multiple reference speakers
Kazumi Ohkura, Hiroki Ohnishi, Masayuki Iida

Experiments with a new algorithm for fast speaker adaptation
Nikko Strom

A study of applying adaptive learning to a multi-module system
Tung-Hui Chiang, Yi-Chung Lin, Keh-Yih Su

Speaker adaptation based on fuzzy vector quantization
Jun'ichi Nakahashi, Eiichi Tsuboka

A study on the simulated annealing of self organized map algorithm for Korean phoneme recognition
Myung-Kwang Kong, Seong-Kwon Lee, Soon-Hyob Kim

Discriminative training of garbage model for non-vocabulary utterance rejection
Celinda de la Torre, Alejandro Acero


Science and Technology for Multimodal Interfaces


Listener eye movement behavior during audiovisual speech perception
Eric Vatikiotis-Bateson, Inge-Marie Eigsti, Sumio Yano

Auditory/visual speech in multimodal human interfaces
Dominic W. Massaro, Michael M. Cohen

Effects of phonological and semantic information of kanji and kana characters on speech perception
Tadahisa Kondo, Kazuhiko Kakehi

Human processing of auditory-visual information in speech perception: potential for multimodal human-machine interfaces
Patricia K. Kuhl, Minoru Tsuzaki, Yoh'ichi Tohkura, Andrew N. Meltzoff

Visual perception of human bodies and faces for multi-modal interfaces
Alex P. Pentland, Trevor Darrell

See me, hear me: integrating automatic speech recognition and lip-reading
Paul Duchnowski, Uwe Meier, Alex Waibel

Integration themes in multimodal human-computer interaction
Sharon Oviatt, Erik Olsen

A multimodal teleconferencing system using hands-free voice control
D. A. Berkley, James L. Flanagan, K. L. Shipley, Lawrence R. Rabiner

Exploring the relation between mcgurk interference and ventriloquism
Paul Bertelson, Jean Vroomen, Geert Wiegeraad, Beatrice de Gelder

Naturalness of the interaction in multimodal applications
Jean-Claude Junqua, Philippe Morin

Evaluation of multimodal interface using spoken language and pointing gesture on interior design system
Haru Ando, Yoshinori Kitahara, Nobuo Hataoka

Linguistic and paralinguistic differences between multimodal and telephone-only dialogues
Kyung-ho Loken-Kim, Fumihiro Yato, Laurel Fais, Tsuyoshi Morimoto, Akira Kurematsu


Measurements and Models of Speech Production


An investigation of the potential role of speech production models in automatic speech recognition
R. C. Rose, Juergen Schroeter, M. Mohan Sondhi

A trajectory formation model of articulatory movements based on the motor tasks of phoneme-specific vocal tract shapes
Tokihiko Kaburagi, Masaaki Honda

Articulatory trajectories generated by the control of the vocal tract by a neural network
Martine George, Paul Jospa, Alain Soquet

Neural network prediction of lip shape from muscle EMG in Japanese speech
Makoto Hirayama, Eric Vatikiotis-Bateson, Vincent Gracco, Mitsuo Kawato

Estimation of the lateral shape of a tongue from speech
Masahiro Hiraike, Shigehisa Shimizu, Takao Mizutani, Kiyoshi Hashimoto

The acoustic-articulatory mapping and the variational method
Paul Jospa, Alain Soquet

Aerodynamical, geometrical and mechanical aspects of bilabial plosives production
X. Pelorson, T. Lallouache, S. Tourret, C. Bouffartigue, Pierre Badin

Investigation of the acoustic characteristics of the velum for vowels
Jianwu Dang, Kiyoshi Honda

Measurement of acoustic impedance density distribution in the near field of the labial horn
Kunitoshi Motoki, Pierre Badin, Nobuhiro Miki

Explicit relations between resonance frequencies and vocal tract cross sections in loss-less kelly-lochbaum and distinctive region vocal tract models
Jean Schoentgen, S. Ciocea

Improving the kelly-lochbaum vocal tract model using conical tube sections and fractional delay filtering techniques
Vesa Välimäki, Matti Karjalainen

Measurement of 3d shapes of vocal tract, dental crown and nasal cavity using MRI: vowels and fricatives
Masafumi Matsumura, Takuya Nukawa, Koji Shimizu, Yasuji Hashimoto, Tatsuya Morita

Accurate measurement of vocal tract shapes from magnetic resonance images of child, female and male subjects
Chang-Sheng Yang, Hideki Kasuya

An MRI study of fricative consonants
Shrikanth Narayanan, Abeer Alwan, Katherine Haker

Phoneme extraction using via point estimation of real speech
Eric Vatikiotis-Bateson, Mark K. Tiede, Y. Wada, Vincent Gracco, Mitsuo Kawato

3d FEM analysis of vocal tract model of elliptic tube with inhomogeneous-wall impedance
Hiroki Matsuzaki, Nobuhiro Miki, Nobuo Nagai, Tohru Hirohku, Yoshihiko Ogawa

Chaotic characteristics of voice fluctuation and its model explanation: normal and pathological voices
Yuki Kakita, Hitoshi Okamoto

Flow theory for analysis of phonation with a membrane model of vocal cord
Tadashige Ikeda, Yuji Matsuzaki

Real-time processing of electroglottographic waveforms for the evaluation of phonation types
B. Craig Dickson, John H. Esling, Roy C. Snell

Global pitch range and the production of low tones in English intonation
Donna Erickson, Kiyoshi Honda, Hiroyuki Hirai, Mary E. Beckman, Seiji Niimi

Measument of palatolingual contact pressure during consonant productions using strain gauge transducer mounted platal plate
Masafumi Matsumura, Kazuo Kimura, Katsumi Yoshino, Takashi Tachimura, Takeshi Wada

A study of sensor arrangements for detecting movements and inclinations of tongue point during speech
Kohichi Ogata, Yorinobu Sonoda

Estimation of temporal processing unit of speech motor programming for Japanese words based on the measurement of reaction time
Shinobu Masaki, Kiyoshi Honda



Speech Synthesis I, II


Generating prosodic structure for Swedish text-to-speech
Merle Horne, Marcus Filipsson

Assigning intonation elements and prosodic phrasing for English speech synthesis from high level linguistic input
Alan W. Black, Paul Taylor

Segmental effects on timing and height of pitch contours
Jan P. H. van Santen, Julia Hirschberg

A study on pitch pattern generation using HMM-based statistical information
Toshiaki Fukada, Yasuhiro Komori, Takashi Aso, Yasunori Ohora

Using a hybrid model in a text-to-sppech system to enlarge prosodic modifications
O. Boeffard, F. Violaro

A new method for estimating Japanese speech rate
Akio Ando, Eiichi Miyasaka

Automatic grapheme-to-phoneme conversion of dutch names
Emmy M. Konst, Louis Boves

Diphone synthesis for the welsh language
Briony Williams

Pause control in Japanese text-to-speech conversion system with lexical discourse grammar
Shinichi Doi, Kazuhiko Iwata, Kazunori Muraki, Yukio Mitome

Generation of prosody in speech synthesis using large speech data-base
Naohiro Sakurai, Takerni Mochida, Tetsunori Kobayashi, Katsuhiko Shirai

Preserving naturalness in synthetic voices while minimizing variation in formant frequencies and bandwidths
Niels-Jorn Dyhr, Marianne Elmlund, Carsten Henriksen

Japanese text-to-speech conversion software for personal computers
Kazuhiro Takahashi, Kazuhiko Iwata, Yukio Mitome, Keiko Nagano

Automatic labeling of speech synthesis corpora
Annemie Vorstermans, Jean-Pierre Martens

On synthesis units for Japanese text-to-speech synthesis
Yasushi Ishikawa, Kunio Nakajima

Inducing concatenative units from machine readable dictionaries and corpora for speech synthesis
Judith L. Klavans, Evelyne Tzoukermann

Structure and representation of an inventory for German speech synthesis
Thomas Portele, Florian Höfer, Wolfgang J. Hess

Towards a prosodic cues-based modelling of phonological variability for text-to-speech synthesis
Anne Lacheret-Dujour, Vincent Pean

Rule-based vs neural network-based approaches to letter-to-phone conversion for portuguese common and proper names
Isabel Trancoso, M. Ceu Viana, Fernando M. Silva, Goncalo C. Marques, Luis C. Oliveira

A corpus-based Mandarin text-to-speech synthesizer
Benjamin Ao, Chilin Shih, Richard Sproat

Speech editor based on enhanced user-system interaction for high quality text-to-speech synthesis
Kazuo Hakoda, Tomohisa Hirokawa, Kenzo Itoh

A new system for text-to-speech conversion, and its application to Swedish
Mats Ljungqvist, Anders Lindstrom, Kjell Gustafson

A novel segment-concatenation algorithm for a cepstrum-based synthesizer
Yoshinori Shiga, Yoshiyuki Hara, Tsuneo Nitta

Naturalness and intelligibility of rule-synthesized speech, supplied with specific spectro-temporal features derived from natural continuous speech
Florien J. Koopmans-van Beinum, Louis C. W. Pols



Language Modeling for Speech Recognition


Stochastic context-free language modeling with evolutional grammars
Michael K. Brown, Stephen C. Glinski

A lightweight parser for speech understanding
Nigel Ward

Dynamic probabilistic grammar for spoken language disambiguation
Takeshi Kawabata

Speaker-consistent parsing for speaker-independent continuous speech recognition
Kouichi Yamaguchi, Harald Singer, Shoichi Matsunaga, Shigeki Sagayama

A stochastic morphological analyzer for spontaneously spoken languages
Masaaki Nagata

Automatic adaptive understanding of spoken language by cooperation of syntactic parsing and semantic priming
Jean-Yves Antoine, Jean Caelen, Bertand Caillaud

A maximum entropy model for parsing
Adwait Ratnaparkhi, Salim Roukos, R. Todd Ward

Sentence spotting using continuous structuring method
Jiro Kiyama, Yoshiaki Itoh, Ryuichi Oka

Continuous speech recognition using a dialog-conditioned stochastic language model
Hiroyuki Sakamoto, Shoichi Matsunaga

Keyword and phrase spotting with heuristic language model
Tatsuya Kawahara, Toshihiko Munetsugu, Norihide Kitaoka, Shuji Doshita

A spontaneous speech recognition algorithm using word trigram models and filled-pause procedure
Jin'ichi Murakami, Shoichi Matsunaga

Active/non-active word control using garbage model, unknown word re-evaluation in speech conversation
Masayuki Yamada, Yasuhiro Komori, Yasunori Ohora

Error-responsive modifications to speech recognizers: negative n-grams
L. Chase, R. Rosenfeld, Wayne Ward

Towards better language models for spontaneous speech
B. Suhm, Alex Waibel

Empirical acquisition of language models for speech recognition
Michael K. McCandless, James R. Glass

Prediction of prosodic phrase boundaries using stochastic context-free grammar
Shigeru Fujio, Yoshinori Sagisaka, Norio Higuchi

Language models for spontaneous speech recognition: a bootstrap method for learning phrase digrams
Egidio Giachin, Paolo Baggia, Giorgio Micca

Inferring linguistic structure in spoken language
M. Woszczyna, Alex Waibel

Back-off smoothing in a syntactic approach to language modelling
G. Bordel, I. Torrest, Enrique Vidal

Computer assisted grammar construction
H.-H. Shih, Steve J. Young

Language model estimations and representations for real-time continuous speech recognition
Giuliano Antoniol, Fabio Brugnara, Mauro Cettolo, Marcello Federico

Sub-dictionary statistical modeling for isolated word recognition
Bruno Jacob, Regine André-Obrecht

A class bigram model for very large corpus
Michele Jardino


Models and Systems for Spoken Dialogue


A spoken dialogue system based on hierarchical feedback mechanism
Akio Amano, Toshiyuki Odaka

A dedicated task-oriented dialogue theory in support of spoken language dialogue systems design
Niels Ole Bernsen, Laila Dybkjaer, Hans Dybkjaer

Interactive speech dialogue system using simultaneous understanding
Farzad Ehsani, Kaichiro Hatazaki, Jun Noguchi, Takao Watanabe

A cooperative man-machine dialogue model for problem solving
Masahiro Araki, Taro Watanabe, Felix Merida Quimbo, Shuji Doshita

A multi-modal dialogue system for telephone directory assistance
Osamu Yoshioka, Yasuhiro Minami, Kiyohiro Shikano

Automated query identification in English dialogue
Mark Terry, Randall Sparks, Patrick Obenchain

Robust discourse processing considering misrecognition in spoken dialogue system
Keiichi Sakai, Yuji Ikeda, Minoru Fujita

Analysis of multimodal interaction data in human communication
Keiko Watanuki, Kenji Sakamoto, Fumio Togawa

Changes in user's responses with use of a speech dialog system
Kazuhiro Arai

Collecting and analyzing nonverbal elements for maintenance of dialog using a wizard of oz simulation
Itou Katunobu, Akiba Tomoyosi, Hasegawa Osamu, Hayamizu Satoru, Tanaka Kazuyo

Porting the bilingual voyager system to Italian
Giovanni Flammia, James R. Glass, Michael Phillips, Joseph Polifroni, Stephanie Seneff, Victor W. Zue

Similarity-based identification of repairs in Japanese spoken language
Gen-Ichiro Kikui, Tsuyoshi Morimoto

Rapid prototyping of a dialogue system using a generic dialogue development platform
Lars Bo Larsen, Anders Baekgaard

Heuristics for generating acoustic stress in dialogues and examination of their validity
Shozo Naito, Akira Shimazu

Application and dialogue in the sundial system
Jacques Siroux, Mouloud Kharoune, Marc Guyomard

A dialog analysis using information of the previous sentence
Shin-ichiro Kamei, Shin-ichi Doi, Takako Komatsu, Susumu Akamine, Hitoshi Iida, Kazunori Muraki

Recognizing plans in more natural dialogue utterances
Kiyoshi Kogure, Akira Shimazu, Mikio Nakano

Understanding of time constituents in spoken language dialogues
Bernd Hildebrandt, Gernot A. Fink, Franz Kummert, Gerhard Sagerer

An analysis of Japanese sentences in spoken dialogue and its application to communicative intention recognition
Tadahiko Kumamoto, Akira Ito, Tsuyoshi Ebina

Extra propositional focus and belief revision
Beth Ann Hockey

Frames, a unified model for the representation of reference and space in a man-machine dialogue
Daniel Schang, Laurent Romary

Roles of interjectory utterances in spoken discourse
Masahito Kawamori, Akira Shimazu, Kiyoshi Kogure

Communicative mode dependent contribution from the recipient in information providing dialogue
Yukiko Ishikawa

Strategies for oral dialogue control
Alain Cozannet, Jacques Siroux

Robust speech understanding
Astrid Brietzmann, Fritz Class, Ute Ehrlich, Paul Heisterkamp, Alfred Kaltenmeier, Klaus Mecklenburg, Peter Regel-Brietzmann

Dialog context dependencies of utterances generated from concept reperesentation
Yoichi Yamashita, Keiichi Tajima, Yasuo Nomura, Riichiro Mizoguchi

Effects on utterances caused by knowledge on the hearer
Shu Nakazato, Katsuhiko Shirai

An efficient data-driven model for cooperative spoken dialogue
A. Ferrieux, M. D. Sadek

Multilingual language generation across multiple domains
James R. Glass, Joseph Polifroni, Stephanie Seneff


Speech Recognition in Adverse Environments


Compensation of telephone line effects for robust speech recognition
C. Mokbel, R. Paches-Leal, D. Jouvet, J. Monné

Telephone line characteristic adaptation using vector field smoothing technique
Jun-ichi Takahashi, Shigeki Sagayama

A study of speech recognition system robustness to microphone variations: experiments in phonetic classification
Jane Chang, Victor W. Zue

Isolated word recognition using models for acoustic phonetic variability by lombard effect
Tadashi Suzuki, Kunio Nakqjima, Yoshiharu Abe

A source generator based production model for environmental robustness in speech recognition
John H. L. Hansen, Brian D. Womack, Levent M. Arslan

A frequency-weighted continuous density HMM for noisy speech recognition
Hiroshi Matsumoto, Hiroyuki Imose

A study on adaptations of cepstral and delta cepstral coefficients for noisy speech recognition
Lee-Min Lee, Hsiao-Chuan Wang

A comparative study of feature representations for robust speech recognition in adverse environments
Kuldip K. Paliwal, Bishnu S. Atal

ARDOSS: autoregressive domain spectral subtraction for robust speech recognition in additive noise
Hugo Van Hamme

Speech recognition with rapid environment adaptation by spectrum equalization
Keizaburo Takagi, Hiroaki Hattori, Takao Watanabe

Signal processing for robust speech recognition
Richard M. Stern, Fu-Hua Liu, Pedro J. Moreno, Alejandro Acero

A comparison of three noisy speech recognition approaches
Olivier Siohan, Yifan Gong, Jean-Paul Haton



Prosody of Discourse and Dialogue


Intonation pattern with focus and related muscle activities in tokyo dialect
Shigeru Kiritani, Kikuo Maekawa, Hajime Hirose

The effects of contrastive accent and lexical stress upon temporal distribution in a sentence
Jianfen Cao

Speech rate and syllable timing in spontaneous speech
Henrietta J. Cedergren, Hélène Perreault

An experimental phonetic study of speech rhythm in standard Korean
Hyun-bok Lee, Narn-taek Jin, Cheol-jae Seong, Il-jin Jung, Seung-mie Lee

A rhythm theory for spontaneous speech: the role of vowel amplitude in the rhythmic hierarchy
Noriko Umeda, Toby Wedmore

Modelling Swedish prosody in a dialogue framework
Gösta Bruce, Björn Granström, Kjell Gustafson, David House, Paul Touati

Prosodic characteristics of a spoken dialogue for information query
Hiroya Fujisaki, Sumio Ohno, Masafumi Osame, Mayumi Sakata, Keikichi Hirose

Analysis of prosodic and linguistic features of spontaneous Japanese conversational speech
Shoichi Takeda, Yoshiyuki Itoh, Norifumi Sakuma, Kei Yokosato

Combining the use of duration and F0 in an automatic analysis of dialogue prosody
Nick Campbell

Improving parsing by incorporating 'prosodic clause boundaries into a grammar
G. Bakenecker, U. Block, Anton Batliner, Ralf Kompe, Elmar Nöth, Peter Regel-Brietzmann

A prosodic recognition module based on linear discriminant analysis
Andrew Hunt

Use of prosodic features in the recognition of continuous speech
Keikichi Hirose, Atsuhiro Sakurai, Hiroyuki Konno


Spoken Language Cognition and Its Disorders


The inconsistency of consistency effects in reading: the case of Japanese kanji phonology
Taeko Nakayama Wydell, Brian Butterworth

An acoustic analysis of unreleased stop consonants in word-final position
Valter Ciocca, Livia Wong, Lydia K. H. So

Speech segmentation in dutch: no role for the syllable
Jean Vroomen, Beatrice de Gelder

Do ambiguous fricatives rhyme? lexical involvement in phonetic decision-making depends on task demands
James M. McQueen

Moraic segmentation in Japanese revisited
P. A. Halle, Juan Segui

Prosodic information and processing of temporarily ambiguous constructions in Japanese
Jennifer J. Venditti, Hiroko Yamashita

Role of prosodic features in the human process of speech perception
Nobuaki Minematsu, Keikichi Hirose

Limitations of lip-reading advantage by desynchronizing visual and auditory information in speech
Masahiro Hashimoto, Hideaki Seki

Word meaning deafness: effects of word type
Sue Franklin, Judy Turner, Julie Morris

Concept and grammar acquisition based on combining with visual and auditory information
Mikio Masukata, Seiichi Nakagawa

The punch and judy man: a study of phonological / phonetic variation
Gavin J. Dempster, Sheila M. Williams, Sandra P. Whiteside

The auditory perception of children's age and sex
Hartmut Traunmüller, Renée van Bezooijen

Are representations used for talker identification available for talker normalization?
James S. Magnuson, Reiko A. Yamada, Howard C. Nusbaum

Non-physiological differences between male and female speech: evidence from the delayed F0 fall phenomenon in Japanese
Yoko Hasegawa, Kazue Hata

Speaker individualities in speech spectral envelopes
Tatsuya Kitamura, Masato Akagi

Prosodic imitation: productional results
Duncan Markham

Articulatory description of affricate production in speech disordered children using electropalatography (EPG)
Fiona Gibbon, William J. Hardcastle

A phonetic and phonological analysis of stuttering in Japanese
Akira Ujihira, Haruo Kubozono

Perception, production and training of new consonant contrasts in children with articulation disorders
Donald G. Jamieson, Susan Rvachew

Cognitive processes of speech sounds in a brain-damaged patient
Sachiko Nakakoshi, Atsushi Mizobuchi, Hiroto Katori

A cross-linguistic study of lateral /s/ using electropalatography (EPG)
N. Suzuki, H. Dent, M. Wakumoto, Fiona Gibbon, K. Michi, William J. Hardcastle

Prosody of recurrent utterances in aphasic patients
Junko Matsubara, Toshihiro Kashiwagi, Morio Kohno, Hirotaka Tanabe, Asako Kashiwagi

Intonation and language teaching
Virginia LoCastro

A computer-aided phonetic instruction system for south-asian languages
Tsuyoshi Nara, P. Bhaskararao

Rhythm processing by a patient with pure anarthria: some suggestions on the role of rhythm in spoken language processing
Morio Kohno, Junko Matsubara, Katsuko Higuchi, Toshihiro Kashiwagi

Japanese accentuation of foreign learners and its interlanguage
Nobuko Yamada

Mechanisms producing recurring utterances in a patient with slowly progressive aphasia
Masato Kaneko

Hypermedia for spoken language education
Kiyokata Katoh, Takako Ayusawa, Yukihiro Nishinuma, Richard Harrison, Kikuko Yamashita

A text-to-speech system for application by visually handicapped and illiterate
P. Bhaskararao, Venkata N. Peri, Dr. Vishwas Udpikar


Spoken Language Systems and Assessments


Talker localization and speech recognition using a microphone array and a cross-powerspectrum phase analysis
Diego Giuliani, Maurizio Omologo, P. Svaizer

System of microphone arrays and neural networks for robust speech recognition in multimedia environments
Qiguang Lin, Ea-Ee Jan, Chi Wei Che, Bert de Vries

Estimating performance of pipelined spoken language translation systems
Manny Rayner, David Carter, Patti Price, Bertil Lyberg

Generation of multi-syllable nonsense words for the assessment of Korean text-to-speech system
Cheol-Woo Jo, Kyung-Tae Kim, Yong-Ju Lee

Voice map: a dialogue-based spoken language information access system
Aruna Bayya, Michael Durian, Lori Meiskey, Rebecca Root, Randall Sparks, Mark Terry

Development of a document preparation system with speech command using EDR electronic dictionaries
Shigenobu Seto, Kazuhiro Kimura

Radiological reporting by speech recognition: the a.re.s. system
Bianca Angelini, Giuliano Antoniol, Fabio Brugnara, Mauro Cettolo, Marcello Federico, Roberto Fiutem, Gianni Lazzari

A spoken language system for information retrieval
S. K. Bennacef, H. Bonneau-Maynard, Jean-Luc Gauvain, Lori F. Lamel, Wolfgang Minker

Recogniser response modelling from testing on series of minimal word pairs
Borge Lindberg

A study on the problems for apllication of voice interface based on ford recognition
Toshimitsu Minowa, Yasuhiko Arai, Hisanori Kanasashi, Tatsuya Kimura, Takuji Kawamoto

A UI design support tool for multimodal spoken dialogue system
Hiroyuki Kamio, Mika Koorita, Hiroshi Matsu'ura, Masafumi Tamura, Tsuneo Nitta

Multimodal drawing tool using speech, mouse and key-board
Takuya Nishirnoto, Nobutoshi Shida, Tetsunori Kobayashi, Katsuhiko Shirai

Generation of non-entry words from entries of the natural speech database
Yasuhiko Arai, Toshimitsu Minowa, Hiroko Yoshida, Hirofmi Nishimura, Hiroyvki Kamata, Takashi Honda

MECALLSAT: a multimedia environment for computer-aided language learning incorporating speech assessment techniques
Pedro Gomez, Daniel Martinez, Victor Nieto, Victoria Rodellar

Improving recognizer acceptance through robust, natural speech repair
Arthur E. McNair, Alex Waibel

User acceptance of automatic speech recognition in telephone services
David Fay

Identifying salient usability attributes for automated telephone services
Stephen Love, R. T. Dutton, J. C. Foster, Mervyn A. Jack, F. W. M. Stentiford

Word complexity measures in the context of speech intelligibility tests
Arnd Mariniak

Recognition accuracy methods and measures
Frank H. Wu, Monica A. Maries

A feature-profile for application-specific speech synthesis assessment and evaluation
Ute Jekosch, Louis C. W. Pols

A description model for speech assessment tests with subjects
Thomas Hegehofer

VLSI implementation of a robust hybrid parameter-extractor and neural network for speech decoding
Victoria Rodellar, Antonio Diaz, Jose Gallardo, Virginia Peinado, Victor Nieto, Pedro Gomez

An objective measure for qualitatively assessing low-bit-rate coded speech
Toshiro Watanabe, Shinji Hayashi

Performance comparison of recognition systems based on the akaike information criterion
Kazuhiko Ozeki

Robust speech recognition in the automobile
Nobutoshi Hanai, Richard M. Stern

On the development of a dictation machine for Spanish: DIVO
Javier Macias-Guarasa, Manuel A. Leandro, Jose Colas, Alvaro Villegas, Santiago Aguilera, Jose M. Pardo

Environmental robustness in automatic speech recognition using physiologic ally-motivated signal processing
Yoshiaki Ohshima, Richard M. Stern


Large Vocabulary/Speaker Independent Speech


Recognition ********* a dynamic network decoder design for large vocabulary speech recognition
V. Valtchev, J. J. Odell, Phil C. Woodland, Steve J. Young

A word graph algorithm for large vocabulary, continuous speech recognition
Hermann Ney, Xavier Aubert

Fast match for segment-based large vocabulary continuous speech recognition
Michael Phillips, David Goddeau

Multiple-pronunciation lexical modeling in a speaker independent speech understanding system
Chuck Wooters, Andreas Stolcke

MMIE training for large vocabulary continuous speech recognition
Yves Normandin, Roxane Lacouture, Regis Cardin

An intelligent and efficient word-class-based Chinese language model for Mandarin speech recognition with very large vocabulary
Yen-Ju Yang, Sung-Chien Lin, Lee-Feng Chien, Keh-Jiann Chen, Lin-Shan Lee

Tree-structured speaker clustering for speaker-independent continuous speech recognition
Tetsuo Kosaka, Shoichi Matsunaga, Shigeki Sagayama

Compact-size speaker independent speech recognizer for large vocabulary using "compats" method
Tatsuya Kimura, Hiroyasu Kuwano, Akira Ishida, Taisuke Watanabe, Shoji Hiraoka

A keyword-spotting unit for speaker-independent spontaneous speech recognition
Yasuyuki Masai, Jun'Ichi Iwasaki, Shin'Ichi Tanaka, Tsuneo Nitta, Masahiro Yao, Tomohiro Onogi, Akira Nakayama

KT-stock: a speaker-independent large-vocabulary speech recognition system over the telephone
Myoung-Wan Koo, Sang-Kyu Park, Kyung-Tae Kong, Sam-joo Doh

Speaker independent continuous speech recognition using an acoustic-phonetic Italian corpus
Bianca Angelini, Fabio Brugnara, D. Falavigna, Diego Giuliani, Roberto Gretter, Maurizio Omologo




Neural Network and Connectionist Approaches


Minimum-error-rate training of predictive neural network models
KyungMin Na, JaeYeol Rheem, SouGuil Ann

Spoken language acquisition for automated call routing
Allen L. Gorin, H. Hanek, R. C. Rose, L. Miller

A speech recognition system using both auditory and afferent pathway signal processing
Eliathamby Ambikairajah, Owen Friel, William Millar

Using gamma filters to model temporal dependencies in speech
Steve J. Renals, Mike M. Hochberg

Phone recognition using a transition-controlled, segment-based dp/mlp hybrid
Jan Verhasselt, Jean-Pierre Martens

Large vocabulary continuous speech recognition using a hybrid connectionist-HMM system
Mike M. Hochberg, Steve J. Renals, A. J. Robinson, D. J. Kershaw

A multi-state NN/HMM hybrid method for high performance speech recognition
Dong Yu, Taiyi Huang, Dao Wen Chen

A continuous HMM based preprocessor for modular speech recognition neural networks
Fikret S. Gurgen, J. M. Song, R. W. King

System integrating connectionist and ibolic approaches for spoken language understanding
Ying Cheng, Paul Fortier, Yves Normandin

Recent work in hybrid neural networks and HMM systems in CSR tasks
Xavier Menendez-Pidal, Javier Ferreiros, Ricardo de Cordoba, Josi M. Pardo

Hidden Markov models and selectively trained neural networks for connected confusable word recognition
Jean-Frangois Mari, Dominique Fohr, Yolande Anglade, Jean-Claude Junqua

Modeling dynamics in connectionist speech recognition - the time index model
Yochai Konig, Nelson Morgan

Mandarin syllables recognition by subsyllables dynamic neural network
Dao Wen Chen, Xiao Dong Li, San Zhu, Dongxin Xu, Taiyi Huang

Evaluation of phonetic feature recognition with a time-delay neural network
Shigeki Okawa, Christoph Windheuser, Frédéric Bimbot, Katsuhiko Shirai

A self organizing feature map based on the fisher discriminant
Enric Monte, Javier Hernando

Using wavelet dyadic grids and neural networks for speech recognition
Richard R. Favero, Fikret S. Gurgen

A normalization method of prediction error for neural networks
Hiroaki Hattori

Recurrent neural network word models for small vocabulary speech recognition
Philippe Le Cerf, Dirk Van Compernolle

A novel fuzzy partition model architecture for classifying dynamic patterns
Yoshinaga Koto, Shigeru Katagiri

Handling missing data in speech recognition
Martin Cooke, Phil Green, Malcolm Crawford

A new probabilistic framework for connectionist time alignment
Patrick Haffner

A speech recognition model using internal degrees of freedom
Ken-ichi Iso

Adaptation of neural network model: comparison of multilayer perceptron and LVQ
Dongxin Xu, Dao Wen Chen, Qian Ma, Bo Xu, Taiyi Huang

Simplified sub-neural-networks for accurate phoneme recognition
Takuya Koizumi, Shuji Taniguchi, Ken-ichi Hattori, Mikio Mori

A neural network for phonetically decoding the speech trace
Victoria Rodellar, Victor Nieto, Pedro Gomez, Daniel Martinez, M. Mercedes Perez

Noise robust speech recognition using a dynamic-cepstrum
Kiyoaki Aikawa, Tsuyoshi Saito


Speech Analysis and Enhancement


Telephone-band speech enhancement based on the fundamental frequency component compensation
Toshiyuki Aritsuka, Yoshito Nejime

Reduction of noise level by SPAD (speech processing system by use of auto-difference function)
Nobuyuki Kunieda, Tetsuya Shimamura, Jouji Suzuki, Hiroyuki Yashima

An algorithm to reconstruct wideband speech from narrowband speech based on codebook mapping
Yuki Yoshida, Masanobu Abe

An hmm-based cepstral-domain speech enhancement system
C. W. Seymour, M. Niranjan

Voice adaptation using multi-functional transformation with weighting by radial basis function networks
Naoto Iwahashi, Yoshinori Sagisaka

A dynamic-window weighted-RMS averaging filter applied to speaker identification
Hong Tang, Xiaoyuan Zhu, Iain Macleod, Bruce Millar, Michael Wagner

Quality enhancement of band limited speech by filtering and multirate techniques
Hiroshi Yasukawa

Characteristics of multi-layer perceptron models in enhancing degraded speech
T. T. Le, J. S. Mason, T. Kitamura

A time-frequency analysis technique for speech recognition signal processing
Adam B. Fineberg, Kevin C. Yu

Estimation of the glottal pulseform based on discrete all-pole modeling
Paavo Alku, Erkki Vilkman

Analysis and detection of double talk in telephone dialogs
H. Nishi, M. Kitai

A self-learning approach to transcription of danish proper names
Ove Andersen, Paul Dalsgaard

A time-varying analysis based on analytic speech signals
Eisuke Horita, Yoshikazu Miyanaga, Koji Tochinai

New spectrum interpolation method for improving quality of synthesized speech
Takashi Endo, Shun'ichi Yajima

Automatic context-sensitive measurement of the acoustic correlates of distinctive features at landmarks
Mark Johnson

A comparison of different acoustic and articulatory representations for the determination of place of articulation of plosives
Alain Soquet, Marco Saerens

An analysis of voice quality using sinusoidal model
Naotoshi Osaka

Fast formant estimation of children's speech
Alan A. Wrench, M. M. Watson, D. S. Soutar, A. G. Robertson, John Laver

Some fast higher order AR estimation techniques applied to parametric wiener filtering
Josep Salavedra, Enrique Masgrau, Asuncidn Moreno, Joan Estarellas, Javier Hernando

Applications of a rule-based speech synthesizer module
Mikio Yamaguchi, Shigeharu Toyoda, Katsuhiro Yada

Quasi-articulatory formant synthesis
Jon Iles, William Edmondson

On the connection between manual segmentation conventions and "errors" made by automatic segmentation
Knut Kvale

Natural utterance segmentation and discourse label assignment
Mutsuko Tomokiyo

Possibility of speech synthesis by common voice source
Satoshi Yumoto, Jouji Suzuki, Tetsuya Shimamura

A scheme for Chinese speech synthesis by rule based on pitch-synchronous multi-pulse excitation LP method
Changfu Wang, Wenshen Yue, Keikichi Hirose, Hiroya Fujisaki

Text processing within a speech synthesis system
Anders Lindstrom, Mats Ljungqvist

E-mail to voice-mail conversion using a portuguese text-to-speech system
P. Carvalho, P. Lopes, Isabel Trancoso, Luis C. Oliveira

Tempo estimation by wave envelope for recognition of paralinguistic features in spontaneous speech
Shigeyoshi Kitazawa, Satoshi Kobayashi, Takao Matsunaga, Hideya Ichikawa




Speech/Language Database


A speech and language database for speech translation research
Tsuyoshi Morimoto, Noriyoshi Uratani, Toshiyuki Takezawa, Osamu Furuse, Yasuhiro Sobashima, Hitoshi Iida, Atsushi Nakamura, Yoshinori Sagisaka, Norio Higuchi, Yasuhiro Yamazaki

The translanguage English database (TED)
Lori F. Lamel, Florian Schiel, Adrian Fourcin, Joseph Mariani, Hans G. Tillmann

The data collection of voice across Japan (VAJ) project
Ikuo Kudo, Takao Nakama, Nozomi Arai, Nahoko Fujimura

Creation and analysis of the dutch polyphone corpus
M. Damhuis, T. Boogaart, C. in't Veld, M. Versteijlen, W. Schelvis, L. Bos, Louis Boves

The design and efficient recording of a 3000 speaker scandinavian telephone speech database: rafael.0
Per Rosenbeck, Bo Baungaard, Claus Jacobsen, Dan-Joe Barry

The VESTEL telephone speech database
D. Tapias, Alejandro Acero, J. Esteve, J. C. Torrecilla

Telephone speech corpus development at CSLU1
Ronald Cole, Mark Fanty, Mike Noel, Terri Lander

Derivation of a large speech and natural language database through alignment of court recordings an their transcripts
P. E. Kenne, Hamish Pearcy, Mary O'Kane

Description of the caip speech corpus
Qiguang Lin, ChiWei Che, Joe French

Automating the design of compact linguistic corpora
Rob Kassel

Annotating illocutionary force types and phonological features into a spontaneous dialogue corpus: an experimental study
Kazuyo Tanaka, Kanae Kinebuchi, Naoko Houra, Kazuyuki Takagi, Shuichi Itahashi, Katsunobu Itou, Satoru Hayamizu


Speaker, Language and Phoneme Recognition


Cepstral channel normalization techniques for HMM-based speaker verification
Aaron E. Rosenberg, Chin-Hui Lee, Frank K. Soong

Noise reduction for speech recognition and speaker verification in mobile telephony
Vijay Raman, Jayant M. Naik

Discriminative phonemes for speaker identification
Eluned S. Parris, Michael J. Carey

Speaker identification in noisy conditions using linear prediction of the one-sided autocorrelation sequence
Javier Hernando, Climent Nadeu, Carlos Villagrasa, Enric Monte

A text-independent speaker identification system based on neural networks
Jialong He, Li Liu, Günther Palm

Hybrid threshold approach in text-independent speaker verification
Fangxin Chen, Bruce Millar, Michael Wagner

Speaker recognition based on subspace methods
Y. Ariki, K. Doi

Performance improvement of speaker recognition system for small training data
Seong Jin Yun, Yung Hwan Oh

A speaker verification system using prosodic features
B. Yegnanarayana, S. P. Wagh, S. Rajendran

Statistical trajectory models for phonetic recognition
William D. Goldenthal, James R. Glass

A common phone model representation for speech recognition and synthesis
Mats Blomberg

Spontaneous speech language identification with a knowledge of linguistics
Shubha Kadambe, James L. Hieronymus

Recent improvements in an approach to segment-based automatic language identification
Timothy J. Hazen, Victor W. Zue

Language identification with embedded word models
Padma Ramesh, David B. Roe

Language identification of six languages based on a common set of broad phonemes
Kay M. Berkling, Etienne Barnard

Three language identification methods based on HMMs
Allan A. Reyes, Takashi Seino, Seiichi Nakagawa

Spoken language discrimination using speech fundamental frequency
Shuichi Itahashi, Jian Xiong Zhou, Kimihito Tanaka

Application of inter-language phoneme similarities for language identification
Paul Dalsgaard, Ove Andersen

Comparison of acoustic features and robustness tests of a real-time recogniser using a hardware telephone line simulator
Hugo Van Hamme, Guido Gallopyn, Ludwig Weynants, Bart D'hoore, Hervé Bourlard

Phoneme recognition in various styles of utterance based on mutual information criterion
Shigeki Okawa, Tetsunori Kobayashi, Katsuhiko Shirai

Speaker independent speech recognition method using phoneme similarity vector
Masakatsu Hoshimi, Maki Yamada, Katsuyuki Niyada

Phoneme recognition using acoustic events
Kai Hübener, Julie Carson-Berndsen

Contributions of selected spectral regions to vowel classification accuracy
Parham Mokhtari, Frantz Clermont

Filtering of spectral parameters for speech recognition
Climent Nadeu, Biing-Hwang Juang

Pitch-based emphasis detection for segmenting speech recordings
Barry Arons

Overlapping phone segments
Z. Li, P. Kenny

Clustering triphones by phonological mapping
Maurice K. Wong


Speech Perception and Speech Related Disorders


Stochastic perceptual auditory-event-based models for speech recognition
Nelson Morgan, Herve Bourlard, Steven Greenberg, Hynek Hermansky

Auditory perception of filled and empty time intervals, and mechanism of time discrimination
Itaru E. Tatsumi, Hiroya Fujisaki

Speech perception and growth of masking in younger and older adults
Margaret F. Cheesman, Jennifer C. Armitage, Kimberley Marshall

A theory of asymmetric intensity enhancement around acoustic transients
Irino Toshio, Roy D. Patterson

Text-to-speech in the speech training of the deaf: adapting models to individual speakers
Hector Javkin, Elizabeth Keate, Norma Antonanzas, Ranjun Zou, Karen Youdelman

Robust pitch and voicing detection using a model of auditory signal processing
Thomas Holton

Listener adaptive characteristics in dialogue speech effects of temporal adjustment on emotional aspects of speech
Satoshi Imaizumi, Akiko Hayashi, Toshisada Deguchi

Effects of acoustic discontinuity and phonemic deviation on the apparent duration of speech segments
Minoru Tsuzaki, Hiroaki Kato, Masako Tanaka

The influence of context on spoken language perception and processing among elderly and hearing impaired listeners
Chie H. Craig, Richard. M. Warren, Tricia B. K. Chirillo

Acceptability of temporal modification in consonant and vowel onsets
Hiroaki Kato, Minoru Tsuzaki, Yoshinori Sagisaka

An integrated acoustic evaluation system of pathologic voice
Weizhong Zhu, Yoshinobu Kikuchi, Yasuo Endo, Hideki Kasuya, Minoru Hirano, Masanao Ohashi

An electronic dictionary of Japanese sign language: design of system and organization of database
Yumiko Fukuda, Wako Ikehara, Erniko Kamikubo, Shizuo Hiki

Synthesis of pathological voice based on a stochastic voice source model
Yasuo Endo, Hideki Kasuya

Hearing aid evaluation using variable - speech - rate audiometry
Hiroshi Hosoi, Yoshiaki Tsuta, Takashi Nishida, Kiyotaka Murata, Fumihiko Ohta, Tsuyoshi Mekata, Yumiko Kato

Relationship between acoustic measures of vocal perturbation and perceptual judgments of breathiness, harshness, and hoarseness
Fred D. Minifie, Daniel Z. Huang, Jordan Green

A hearing aid by single resonant analysis for telephonic speech
Takashi Ikeda, Kouji Tasaki, Akira Watanabe

Perceptual learning of Japanese mora syllables by native speakers of american English an analysis of acquisition processes of speech perception in second language learning
Tsuneo Yamada, Reiko A. Yamada, Winifred Strange

A DSP-based amplitude compressor for digital hearing AIDS
Yuichi Veda, Takayuki Agawa, Akira Watanabe

Perception and production saturation of spoken English as a first phase in reducing a foreign accent
Amalia Sarabasa

Harp: an autonomous speech rehabilitation system for hearing-impaired people
Edmund Rooney, Fabrizio Carraro, Will Dempsey, Katie Robertson, Rebecca Vaughan, Mervyn A. Jack, Jonathan Murray

The intelligibility of Japanese speakers' production of american English /r/, /i/, and /w/, as evaluated by native speakers of american English
Reiko A. Yamada, Winifred Strange, James S. Magnuson, John S. Pruitt, William D. Clarke

Phonetic visualization for speech training system by using neural network
Itaru Nagayama, Norio Akamatsu, Toshiki Yoshino

Perceptual and productive distinction between the English [r] and [l] in prevocalic position by English and Japanese speakers
Elzbieta B. Slawinski

Cortical activation with speech in cochlear implant users: a study with positron emission tomography
Yasushi Naito, Hidehiko Okazawa, Iwao Honjo, Yosaku Shiomi, Haruo Takahashi, Waka Hoji, Michio Kawano, Hiroshi Ishizu, Sadahiko Nishizawa, Yoshiharu Yonekura, Junji Konishi

Comparative study of spectral representations in measuring the English /r/-/l/ acoustic-perceptual dissimilarity
Kiyoaki Aikawa, Reiko A. Yamada

Acoustic simulation of auditory model based speech processor for cochlear implant system
Shigeyoshi Kitazawa, Kazuyuki Muramoto, Juichi Ito

The influence of knowledge and experience during the processing of spoken words: non-native listeners
Makio Kashino, Chie H. Craig

Perception and production of mood in speech by cochlear implant users
David House

A portable digital speech rate converter and its evaluation by hearing-impaired listeners
Yoshito Nejime, Toshiyuki Aritsuka, Toshiki Imamura, Tohru Ifukube, Jun'ichi Matsushima




Continuous Speech Recognition


Continuous speech dictation in French
Jean-Luc Gauvain, Lori F. Lamel, Gilles Adda, Martine Adda-Decker

Labeler agreement in phonetic labeling of continuous speech
Ronald Cole, Beatrice T. Oshika, Mike Noel, Terri Lander, Mark Fanty

Recent technology developments in connected digit speech recognition
Biing-Hwang Juang, Jay G. Wilpon

The berkeley restaurant project
Daniel Jurafsky, Chuck Wooters, Gary Tajchman, Jonathan Segal, Andreas Stolcke, Eric Fosler, Nelson Morgan

Improvements in beam search
Volker Steinbiss, Bach-Hiep Tran, Hermann Ney

Data-based control of the search space generated by multiple knowledge bases for speech recognition
Kevin Johnson, Roberto Garigliano, Russell J. Collingham

Evaluation of unknown word processing in a spoken word recognition system
Atsuhiko Kai, Seiichi Nakagawa

Using accent information to correctly select Japanese phrases made of strings of syllables
Tetsuo Araki, Satoru Ikehara, Hideto Yokokawa

Estimating recognition confidence: methods for conjoining acoustics, semantics, pragmatics and discourse
Sheryl R. Young

Issues in topic identification on the switchboard corpus
John McDonough, Herbert Gish

Automatic speech recognition using dynamically defined speech units
L. Deng, H. Sameti

Modelling syllable characteristics to improve a large vocabulary continuous speech recogniser
M. Jones, Phil C. Woodland

Continuous speech understanding based on automatic learning of acoustic and semantic models
Natividad Prieto, Emilio Sanchis, Luis Palmero

On inter-phrase context dependencies in continuously read Japanese speech
Kazuhiro Kondo, Yu-Hung Kao, Barbara Wheatley

A close high-level interaction scheme for recognition and interpretation of speech
Gernot A. Fink, Franz Kummert, Gerhard Sagerer

Interaction between most reliable acoustic cues and lexical analysis
Sylvie Coste-Marquis

Simultaneous spotting of phonemes and words in continuous speech
Y. Ariki, T. Kawamura

Predicting word spotting performance
Man-hung Siu, Herbert Gish, Robin Rohlicek

A method for word spotting in continuous speech using both segmental and contextual likelihood scores
Sumio Ohno, Hiroya Fujisaki, Keikichi Hirose

Phone-based prefiltering for continuous speech recognition
Renato De Mori, Diego Giuliani, Roberto Gretter

Speech recognition without grammar or vocabulary constraints
Harold Singer, Jun-ichi Takami

Comparison of three approaches to phonetic string generation for large vocabulary speech recognition
Javier Macias-Guarasa, Manuel A. Leandro, Xavier Menendez-Pidal, Jose Colas, Ascension Gallardo, Jose M. Pardo, Santiago Aguilera

Automatic generation of words toward flexible vocabulary isolated word recognition
Pietro Laface, Lorenzo Fissore, F. Ravera

Fast speaker adaptation through spectral transformation for continuous speech recognition
H. C. Choi, R. W. King

Dynamic machine adaptation in a multi-speaker isolated word recognition system
Sekharjit Datta

Discourse structure for spontaneous spoken interactions: multi-speaker vs. human-computer dialogs
Sheryl R. Young

Analysis of voice fundamental frequency contours of German utterances using a quantitative model
Hansjörg Mixdorff, Hiroya Fujisaki


×

Plenary Lectures

Integration of Speech and Natural Language Processing

Articulatory Motion

Cognitive Models for Spoken Language Processing

Semantic Interpretation of Spoken Messages

Prosody

Towards Natural Sounding Synthetic Speech

Statistical Methods for Speech Recognition

Phonetics & Phonology I, II

Adaption and Training for Speech Recognition

Science and Technology for Multimodal Interfaces

Measurements and Models of Speech Production

Applications of Spoken Language Processing

Speech Synthesis I, II

New Approach for Brain Function Research in Speech Perception and Production/

Language Modeling for Speech Recognition

Models and Systems for Spoken Dialogue

Speech Recognition in Adverse Environments

Speech Analysis

Prosody of Discourse and Dialogue

Spoken Language Cognition and Its Disorders

Spoken Language Systems and Assessments

Large Vocabulary/Speaker Independent Speech

Perception and Structure of Spoken Language

Voice Quality

Neural Network and Connectionist Approaches

Speech Analysis and Enhancement

Acquisition of Spoken Language

Education of Spoken Language

Speech/Language Database

Speaker, Language and Phoneme Recognition

Speech Perception and Speech Related Disorders

Speech Coding

The Impact of Signal Processing Technologies

Continuous Speech Recognition