4th European Conference on Speech Communication and Technology

Madrid, Spain
18-21 September 1995

Speech Coding I

Vector quantization of glottal pulses
Thomas Eriksson, Jan Linden, Jan Skoglund

A speech coding algorithm based on prototypes interpolation with critical bands and phase coding
Michele Festa, Daniele Sereno

Very low-bitrate speech coding using perceptually-derived spectral data
D. Tsoukalas, Jiannis Mouropoulos, George Kokkinakis

A new very low bit rate speech coder: the step decomposition vocoder
Lorenzo Piazzo

Time envelope LP vocoder: a new coding technique at very low bit rates
I. A. Atkinson, A. M. Kondoz, B. G. Evans

Speech coding based on the discrete-time wavelet transform and human auditory system properties
Dan Stefanoiu, Radwan Kastantin, Gang Feng

Wavelets for low bit rate speech coding applications
F. J. Ancin, M. L. Larreategui, B. L. Burrows, R. A. Carrasco

Adaptive speech vector coding with a multiresolution hierarchical codebook
E. Mandridake, R. Atay, M. Najim

Subband analysis-by-synthesis coding
Andrei Popescu, Nicolas Moreau

A robust 2.4kb/s LP-MBE with iterative LP modelling
Clifford I. Parris, Danny Wong, Francois Chambon

Improved transient representation and quantization for sinusoidal speech coders
M. S. Torres-Guijarro, F. J. Casajus-Quiros

Efficient multiband excitation linear predictive coding of speech at 1.6 kbps
W. M. E. Yu, Cheung-Fat Chan

Voice coding in the MSBN satellite communication system
Bruno Wery, Stephane Deketelaere

Spectral envelope estimation for low bit-rate sinusoidal speech coders
B. M. G. Cheetham, X. Q. Sun, W. T. K. Wong

Speaker Recognition I-III

On the use of features from prediction residual signals in speaker identification
Jialong He, Li Liu, Günther Palm

Some nonparametric distance measures in speaker verification
Kai Tat Ng, Haizhou Li, Jean-Paul Haton

Adaptive transforms for speaker recognition
Michael J. Carey, Graham D. Tattersall, Eluned S. Parris

Speaker recognition with discriminative speaker VQ models
Kai Tat Ng, Jian Su, Bingzheng Xu

Parametric speaker recognition over large population of telephonic voices
A. Federico, Andrea Paoloni

Speaker recognition experiments in Estonian using multi-layer feed-forward neural nets
Toomas Altosaar, Einar Meister

Effect of utterance duration and phonetic content on speaker identification using second-order statistical methods
Ivan Magrin-Chagnolleau, Jean-Frangois Bonastre, Frédéric Bimbot

Automatic speaker recognition using formants-based nearest-neighbour distance measure
Pavel V. Labulin, Sergey L. Koval, Andrej N. Raev

Discrimination of voices of twins and siblings for speaker verification
M. Mehdi Homayounpour, Gerard Chollet

Theoretical error prediction for a language identification system using optimal phoneme clustering
Kay M. Berkling, Etienne Barnard

Separation of speakers in audio data
Jesper O. Olsen

Text-dependent speaker verification using dynamic time warping and vector quantization of LSF
J.-L. Bonifas, I. Hernaez Rioja, B. Etxebarria Gonzalez, S. Saoudi

On MMI learning of Gaussian mixture for speaker models
Haizhou Li, Jean-Paul Haton, Yifan Gong

Evaluation of Bayes decision approach to automatic determination of thresholds for speaker verification
Yifan Gong

Comparison of different HMM based methods for speaker verification
Daniele Falavigna

Speaker classification by neural network for short utteranses using phoneme groups in Farsi
J. Sheikhzadegan, M. Tebiani, M. Lotfizad, M. R. Roohani

Speaker recognition experiments on the NTIMIT database
J.-L. Le Floch, C. Montacie, M.-J. Caraty

Speaker identification using vector quantisation with codeword-specific derivative coding
Michael Wagner, John S. Mason, J. Bruce Millar

Speaker recognition with temporal transition models
Haizhou Li, Jean-Paul Haton, Jian Su, Yifan Gong

Speaker recognition using HMM composition in noisy environments
Tomoko Matsui, Tomohito Kanno, Sadaoki Furui

Speaker recognition using HMM with experiments on the yoho database
ChiWei Che, Qiguang Lin

Speaker recognition models
Kin Yu, John S. Mason, John Oglesby

Multi-state predictive neural networks for text-independent speaker recognition
T. Artieres, Patrick Gallinari

Spoken Language Resources I

A database for microphone array experimentation
Ea-Ee Jan, Piergiorgio Svaizer, James L. Flanagan

The OGI 22 language telephone speech corpus
T. Lander, Ronald A. Cole, B. T. Oshika, M. Noel

New telephone speech corpora at CSLU
Ronald A. Cole, M. Noel, T. Lander, T. Durham

The Dutch polyphone corpus
E. A. den Os, T. I. Boogaart, Lou Boves, Esther Klabbers

The waxholm application database
J. Bertenstam, Mats Blomberg, Rolf Carlson, Kjell Elenius, Björn Granström, Joakim Gustafson, Sheri Hunnicutt, J. Hogberg, R. Lindell, L. Neovius, Lennart Nord, Antonio de Serpa-Leitao, N. Strom

A pitch extraction reference database
F. Plante, Georg F. Meyer, William A. Ainsworth

Eagles spoken language working group: overview and results
Richard Winski, Roger K. Moore, Dafydd Gibbon

CEUDEX: a data base oriented to context-dependent units training in Spanish for continuous speech recognition
Celinda de la Torre-Munilla, Luis Hernandez-Gomez, Daniel Tapias

Design of a phonetic corpus for a speech database in basque language
K. Lopez de Ipina, I. Torres, L. Onederra

you'd better say nothing than say something wrong: analogy, accuracy and text-to-speech applications
V. Pirrelli, S. Federici

Bulgarian speech database: a pilot study
A. Misheva, S. Dimitrova, V. Filipov, E. Grigoreva, M. Nikov, Peter Roach, S. Arnfield

The Phondat-verbmobil speech corpus
Wolfgang J. Hess, Klaus J. Kohler, Hans-Günther Tillmann

EUROM - a spoken language resource for the EU - the SAM projects
Dominic Chan, Adrian Fourcin, Dafydd Gibbon, Björn Granström, Mark Huckvale, George Kokkinakis, Knut Kvale, Lori Lamel, Borge Lindberg, Asunción Moreno, Jiannis Mouropoulos, Franco Senia, Isabel Trancoso, Corin 't Veld, Jerome Zeiliger

A flexible formal language for the orthographic transcription of spontaneous spoken dialogues
Gernot A. Fink, Michaela Johanntokrax, Brigitte Schaffranietz

Design and implementation of Mandarin speech database in taiwan
Hsiao-Chuan Wang

Search Methods I-II

Word hypothesizer based on reliably detected phoneme similarity regions
Philippe Morin, Ted H. Applebaum

Experimental analysis of the search space for 20 000-word speech recognition
S. Ortmanns, Hermann Ney

Speech parsing by downward request search based on the divide and conquer method
Ming-Sheng Wang, Satoshi Imai

Fast match based on decision tree
Claire Waast, Lalit Bahl, Marc El-Beze

Fast and accurate beam search using forward heuristic functions in HMM-LR speech recognition
Yoshiaki Noda, Shigeki Sagayama

Utterance verification improves closed-set recognition and out-of-vocabulary rejection
Don Colton, Mark Fanty, Ronald A. Cole

A comparison of two exact algorithms for finding the n-best sentence hypotheses in continuous speech recognition
V. M. Jimenez, A. Marzal, J. Monné

Top-down speech detection and n-best meaning search in a voice activated telephone extension system
Kazuya Takeda, Shingo Kuroiwa, Masaki Naito, Seiichi Yamamoto

Fast likelihood computation for continuous-mixture densities using a tree-based nearest neighbor search
Frank Seide

Hamming distance approximation for a fast log-likelihood computation for mixture densities
Peter Beyerlein, Meinhard Ullrich

An efficient output probability computation for continuous HMM using rough and detail models
Yasuhiro Komori, Masayuki Yamada, Hiroki Yamamoto, Yasunori Ohora

Speeding up the score computation of HMM speech regognizers with the bucket voronoi intersection algorithm
J. Fritsch, I. Rogina, Tilo Sloboda, Alex Waibel

Analysis for Speech Recognition I-III

On the speech feature selection problem: are dynamic features more important than the static ones?
Jan Nouza

Filtering the time sequence of spectral parameters for speaker-independent CDHMM word recognition
Climent Nadeu, Pau Paches-Leal, Biing-Hwang Juang

Robust phoneme prototype extraction for speech recognition
Dimitris Tambakas, Nikos Fakotakis, George Kokkinakis

The even transform: a variance-equalizing orthogonal transformation and its application to speech recognition
Melvyn J. Hunt

Using segmental coefficients in HMM speech recognition
Kai Hübener

On the use of the derivative of the pole trajectories of the LPC analysis parameter sequence as an alternative to delta parameters
F. Freitag, E. Monte, Javier Hernando

On the dual role of sequence directionality and coherence in a spectral predictive discrimination model
P. V. S. Rao, R. Raveendran

On the decorrelation of filter-bank energies in speech recognition
Climent Nadeu, Javier Hernando, Monica Gorricho

Time derivatives, cepstrai normaiization, and spectral parameter filtering for continuously spelled names over the telephone
Jean-Claude Junqua, Dominique Fohr, J.-F. Mari, Ted H. Applebaum, Brian A. Hanson

Some new considerations about the spectral form of French stop bursts
Linda Djezzar

Characterization of spectral transition region by various prediction approaches for discriminating stop consonants
P. V. S. Rao, R. Raveendran

Fast automatic segmentation and labeling: results on TIMIT and EUROMO
A. Vorstermanst, Jean-Pierre Martens, Bert Van Coile

Maximum-likelihood estimation for articulatory speech recognition using a stochastic target model
Gordon Ramsay, Li Deng

A comparison of several speech parameters for speaker independent speech recognition and speaker recognition
J. Sirigos, Nikos Fakotakis, George Kokkinakis

Speech parameterization based on phonetic features: application to speech recognition
Nabil N. Bitar, Carol Y. Espy-Wilson

Experiments with linear feature extraction in speech recognition
K. Beulen, L. Welling, Hermann Ney

Nonlinear Feature Transformation Based On Statistical Phoneme Modeling
Christian-M. Westendorf

Skewness and nonstationarity measures applied to reliable speech endpoint detection
Juan L. Navarro-Mesa, Asunción Moreno

The distance set representation of speech segments
Ramesh R. Sarukkai, Dana H. Bollard

Multi-variate mixture probability density modelling of VQ codebook using gradient descent algorithm
S. Dobrisek, R. Mihelic, N. Pavesic

Prosody I-III

Analysis and synthesis of prosodic features in spoken dialogue of Japanese
Mayumi Sakata, Keikichi Hirose

Prosodic influence on segmental quality
Nick Campbell

Towards voice-interactive telephone services in slovenia: on prosody of digits using the sociolinguistic framework
Bojan Petek

Test environment for the two level model of Germanic prominence
Gregor Möhler, Grzegorz Dogil

Linguistic and acoustic characteristics of pause intervals in spontaneous speech
Nancy A. Daly-Kelly

Modeling the contextual effects on prosody in dialog
Y. Yamashita, R. Mizoguchi

Prosodic scoring of word hypotheses graphs
Ralf Kompe, Andreas Kießling, Heinrich Niemann, Elmar Nöth, Ernst Günter Schukat-Talamazzini, A. Zottmann, Anton Batliner

Robust pitch period detection using dynamic programming with an ANN cost function
S. Harbeck, Andreas Kießling, Ralf Kompe, Heinrich Niemann, Elmar Nöth

Automatic detection of major phrase boundaries using statistical properties of superpositional F0 control model parameters
Toshio Hirai, Norio Higuchi, Yoshinori Sagisaka

Using neural networks to locate pitch accents
Paul Taylor

The relation between physiological signals and F0: a quantitative analysis method
Helmer Strik

Analysis of prosodic characteristics in speech advisories and their application to speech output
Masanobu Abe

Pitch and elocution rate of diverts speech
M. GuittonF. Javier Caminero-Gil, Joel Crestel, Laure Charonnat

Detection of accents, phrase boundaries and sentence modality in German with prosodic features
Volker Strom

Synthesis and evaluation of intonation with a superposition model
Yann Morlec, Gérard Bailly, Véronique Aubergé

Pitch accent classification of fundamental frequency contours by hidden Markov models
Marcus Fach, Wolfgang Wokurek

Measuring the perceptual similarity of pitch contours
Dik J. Hermes

Microprosodic study of isolated French word corpora
Philippe Langlais

Benchmarking and Assessment I-II

Collecting and analyzing spoken utterances for a speech controlled application
Johannes Müller, Holger Stahl

Speech intelligibility and loudness assessment in a wireless personal communication
Hiromi Nagabuchi, Akira Takahashi, Mineyoshi Ogawa

An experimental investigation of the input and error correction strategies used by subjects entering digits with the AURIX speech recogniser
K. S. Hone, R. W. Series, C. Baber

Goal-directed generation of intelligibility test vocabularies in the framework of names synthesis
Karim Belhoula

Human factors of a voice-controlled car stereo
Reinhold Haeb-Umbach, Stephan Gamm

Exploring the limits of system-directed dialogue, dialogue evaluation of the danish dialogue system
Niels Ole Bernsen, Hans Dybkjaer, Laila Dybkjaer

Human benchmarks for speaker independent large vocabulary recognition performance
David A. van Leeuwen, Leo-Geert van den Berg, Herman J. M. Steeneken

Predictive assesment for speaker independent isolated word recognisers
Alison Simons

Consistency of inter-transcribers' transcription
Kobayashi Satoshi, Kitazawa Shigeyoshi

Comparison of reference system approaches for the quality assessment of synthesized speech
H. Klaus, A. Niebank

Multi-lingual assessment of speaker independent large vocabulary speech-recognition systems: THE SQALE-PROJECT
Herman J. M. Steeneken, David A. van Leeuwen

Error analysis on field data and improved garbage HMM modelling
K. Bartkova, D. Dubois, D. Jouvet, J. Monné

Interference of speech recognition feedback during diagnostic tasks
E. J. A. Verheijen, F. L. van Nes, L. M. de Bruyn, A. Hasman, J. W. Arends

Neural Networks I-IV

Improving speech recognition using speaker classification
David O. Baldwin, Georg F. Meyer

Modular neural networks with task-specific input parameters for speakerindependent speech recognition
Axel Glaeser

Large vocabulary speaker-independent continuous speech recognition with a new hybrid system based on MMI-neural networks
Gerhard Rigoll, Ch. Neukirchen, J. Rottland

REMAP: recursive estimation and maximization of a posteriori probabilities in connectionist speech recognition
Hervé Bourlard, Yochai Konig, Nelson Morgan

An RNN based speech recognition system with discriminative training
Tan Lee, P. C. Ching, L. W. Chan

Preliminary experiments for automatic speech understanding through simple recurrent networks
M. A. Castano, Enrique Vidal, F. Casacuberta

A neural network using non-uniform units for continuous speech recognition
Ha-Jin Yu, Yung-Hwan Oh

Temporal correlation modeling in a hybrid neural network/hidden Markov model speech recognizer
Horatio Franco, Vassilios Digalakis

Continuous speech segmentation with the gamma memory model
Laurent Buniet, Dominique Fohr

Incorporating fuzzy modelling in a hybrid HMM-ANNs system for CSR tasks
Xavier Menendez-Pidal, Ricardo de Cordoba, Javier Ferreiros, José M. Pardo

Neural networks for nonlinear discriminant analysis in continuous speech recognition
Wolfgang Reichl, S. Harengel, F. Wolfertstetter, Günther Ruske

Speech recognition experiments with a new multilayer LVQ network (MLVQ)
Gerhard Rigoll

Speaker-adaptation for hybrid HMM-ANN continuous speech recognition system
Joao Neto, Luis Almeida, Mike Hochberg, Ciro Martins, Luis Nunes, Steve Renals, Tony Robinson

Distributed binary representations for word recognition by TDNN-DTW hybrid systems
Premysl Puzrla, Frédéric Bimbot, Christoph Windheuser

A robust discrimination method based on selectively trained neural networks for confusable words in noisy conditions
Yolande Anglade

Connectionist speaker normalization and adaptation
Victor Abrash, Horacio Franco, Ananth Sankar, Michael Cohen

A competitive algorithm for training HMM for speech recognition
Pedro L. Galindo

Predictive connectionist speech recognition with a new discriminant learning algorithm
Martin Paping, Hans Marti, Mark Renfer

Preliminary results on speech signal segmentation with recurrent neural networks
Antonio J. Rubio, Ronan G. Reilly

Text independent neural network/rule based hybrid, continuous speech recognition
Klara Vicsi, Attila Vig

Automatic recognition of Cantonese lexical tones in connected speech by multi-layer perceptron
Ying Pang Ng, P. C. Ching, L. W. Chan

Combining HMM processing and formant measurements in automatic speech recognition
Dave Abberley, Phil Green

Recurrent neural prediction models for speech recognition
Kyungmin Na, Jekwan Ryu, Dong-Il Chang, Soo-Ik Chae, Souguil Ann

Exploiting acoustic-phonetic knowledge and neural networks for stop recognition
Linda Djezzar, Jean-Paul Haton

Estimation of speech formant-dynamics using neural networks
P. Gomez, V. Rodellar, A. Alvarez, J. Bobadilla, J. Bernal, V. Nieto, M. Perez

Pathology and Communications I-II

Methodological aspects in a multimedia database of vocal fold pathologies
Maurilio Nunes Vieira, Fergus R. McInnes, Mervyn A. Jack, Arnold Maran, Colin Watson, Moira Little

The application of volterra LMS adaptive filtering to speech enhancement for the hearing impaired
V. Udayashankara, A. P. Shivaprasad

CAPDA: managing intelligibility in children and young adults with down's syndrome or speech disorders
P. Rosso, J. H. Wright, M. Smith

Evaluation of a system for segmental speech quality assessment: voiceless fricatives
Alan A. Wrench, Mary S. Jackson, David S. Soutar, A. Gerry Robertson, Janet MacKenzie Beck

A diagnostic and rehabilitation aid workstation for speech and voice pathologies
B. Teston, B. Galindo

Improvement, evaluation and testing of a low cost multilingual portable speaking aid for the speech impaired
Geza Nemeth, Gabor Olaszy, Laszlo Pataki, Luis Hernandez Gomez, Diamantino Freitas

Empirical study to test the independence of different acoustic voice parameters on a large voice database
Dirk Michaelis, Hans Werner Strube

The spectral analysis of infant cry: an initial approximation
Sergio D. Cano Ortiz, Daniel Escobedo Beceiro, Manuel Socarras Reyes

Portable speech rate conversion system
N. Seiyama, A. Nakamura, A. Imai, T. Takagi, E. Miyasaka

A field test of sivo aid in China
Jialu Zhang

Analysis for palatalized articulation of [s] sounds using synthetic speech
Takayuki Arai, Keiko Okazaki, Setsuko Imatomi, Yuichi Yoshida

An attempt to classify LX signals
Krzysztof Marasek

Time series analysis of glottal cycle lengths of healthy and dysphonic speakers
Jean Schoentgen, Raoul de Guchteneere



