ISCA Archive Eurospeech 1993 Sessions
  ISCA Archive Sessions
top

3rd European Conference on Speech Communication and Technology

Berlin, Germany
22-25 September 1993

Speech Coding


M-LCELP speech coding at bit-rates below 4kbps
Kazunori Ozawa, Masahiro Serizawa, Toshiki Miyano, Toshiyuki Nomura

Fast vector quantization using neural maps for CELP at 2400bps
Eduardo Lopez-Gonzalo, Luis A. Hernandez-Gomez

Improving the speech quality of CELP-coders by optimizing the long-term delay determination
U. Balss, U. Kipper, Herbert Reininger, Dietrich Wolf

A stochastic speech coder with multi-band long-term prediction
Carmen Garcia-Mateo, J. L. Alba-Castro, Luis A. Hernandez-Gomez

Intelligibility evaluation of 4-5 kbps CELP and MBE vocoders: the hermes program experiment
B. W. M. Wery, Herman J. M. Steeneken

Algorithms for the CELP coder with ternary excitation
P. Dymarski, N. Moreau

Complexity reduction for federal standard 1016 CELP coder
M. Mauc, G. Baudoin, M. Jelinek

Objective analysis of the GSM half rate speech codec candidates
F. Wuppermann, Christiane Antweiler, M. Kappelan

A 5600 BPS VSELP speech coder candidate for half-rate GSM
Ira A. Gerson, Mark A. Jasiuk

A speech coder for TV programme description
A. M. Kondoz, B. G. Evans, M. R. Suddle

Pitch synchronous innovation CELP (PSI-CELP)
Satoshi Miki, Kazunori Mano, Hitoshi Ohmuro, Takehiro Moriya

Vocoder design based on HOS
Asunción Moreno, José A. R. Fonollosa, Josep Vidal

Emulation of a formant vocoder at 600 and 800 bps
Nigel Sedgwick

A pitch synchronized synthesizer for the IMBE vocoder
W. Ma, A. M. Kondoz, B. G. Evans

An analysis of the performances of the MBE model when used in the context of a text-to-speech system
Thierry Dutoit, Henri Leich

High-quality synthesis of LPC speech using multiband excitation model
C. F. Chan

High-quality speech coding at 2.4 kbps based on time-frequency interpolation
Yair Shoham

Coding of speech signal by fractal techniques
Luca Marcato, Enzo Mumolo

A new reference signal for evaluating the quality of speech coded at low bit rates
Naomi Asanuma, Hiromi Nagabuchi

A psychophysical study of fourier phase and amplitude coding of speech
Changxue Ma, Douglas O'Shaughnessy







Data Bases, Speech Assessment, Noisy Speech


Albayzin speech database: design of the phonetic corpus
Asunción Moreno, Dolors Poch, Antonio Bonafonte, Eduardo Lleida, Joaquim Llisterri, Jose B. Marino, Climent Nadeu

A software tool for speech collection, recognition and reproduction
Carlos Ribeiro, Isabel Trancoso, Antonio Serralheiro

An object-oriented database for speech processing
Matti Karjalainen, Toomas Altosaar

Automatic annotation using multi-sensor data
Dominic S. F. Chan, Adrian J. Fourcin

Prolog tools for accessing the phondat database of spoken German
Christoph Draxler, Hans G. Tillmann, Barbara Eisen

Cluster-similarity: a useful database for speech processing
Ute Jekosch

SIRVA - a large speech database collected on the Italian telephone network
G. Castagneri, G. Di Fabbrizio, A. Massone, M. Oreglia

Objective assessment of speech communication systems; introduction of a software based procedure
Herman J. M. Steeneken, J. A. Verhave, Tammo Houtgast

Enhanced direct assessment of speech input systems within the SAM-a esprit project
Sven W. Danielsen

Evaluation of prosody in the French version of multilingual text-to-speech synthesis: neutralising segmental information in preliminary tests
Pascale Nicolas, Pascal Romeas

A clinical voice evaluation system
Sokol Saliu, Hideki Kasuya, Yasuo Endo, Yoshinobu Kikuchi

A speech therapy workstation for the assessment of segmental quality: voiceless fricatives
Alan A. Wrench, M. S. Jackson, Mervyn A. Jack, D. S. Soutar, A. G. Robertson, J. MacKenzie, John Laver

A speech enhancement system using higher order ar estimation in real environments
Josep M. Salavedra, Enrique Masgrau, Asunción Moreno, Xavier Jove

Proposal of a composite measure for the evaluation of noise cancelling methods in speech processing
R. Le Bouquin, G. Faucon, A. Akbariazirani

The use of linear prediction and spectral scaling for improving speech enhancement
P. M. Crozier, B. M. G. Cheetham, C. Holt, E. Munday

Robust speaker-independent speech recognition using non-linear spectral subtraction based IMELDA
Helge B. D. Sorensen, Uwe Hartmann






Speech Analysis, Articulatory Modelling


Pitch synchronous calculation of acoustic cues using a cochlea model
Marcel de Leeuw, Jean Caelen

Nonlinear dynamical systems concepts in speech analysis
Stephen McLaughlin, Andrew Lowry

Grouping of acoustical events using cable neurons and the theory of neuronal group selection
Arno J. Klaassen

Computationally efficient methods of calculating instantaneous frequency for auditory analysis
I. R. Gransden, S. W. Beet

Analysing connected speech with wavelets: some Italian data
Francesco Cutugno, Pietro Maturi

Speech transients analysis using AR-smoothed wigner-ville distribution
Krzysztof Marasek

Comparison of the variability of formants and formant targets using dynamic modeling
Michel Pitermann, Jean Caelen

Pitch-synchronous formant extraction by means of a compound auto-regressive model
Jean Schoentgen, Zoubir Azami

A new air flowmeter design for the investigation of speech production
Bernard Teston

Articulatory dynamics of lips in Italian /'vpv/ and /'vbv/ sequences
Emanuela Magno Caldognetto, Kyriaki Vagges, Giancarlo Ferrigno, Claudio Zmarich

Restricted distribution of pharyngeal segments: acoustical or mechanical constraints?
Ahmed M. Elgendy

Vowel normalization by articulatory normalization first attemps for vowel transitions
Yohan Payan, Pascal Perrier

Synthesis and analysis of vocal source with vibration of larynx
Nobuhiro Miki, Naohisa Kamiyama, Nobuo Nagai

Towards an acoustic-phonetic classification of modern standard arabic vowels
Imad Znagui, Sami Boudelaa

Divers' speech: variable encoding strategies
Alain Marchal, Christine Meunier

Phonetic reduction processes in spontaneous speech
L. Aguilar, B. Blecua, M. Machuca, R. Mann

Spectral characteristics of fricative sound
N. R. Ganguli

Automatic speaker recognition and analytic process
Jean-Francois Bonastre, Henri Meloni

Second formant locus-nucleus patterns in French and Swedish
Danielle Duez

Temporal organisation of segments and sub-segments in consonant clusters.
Christine Meunier

Automatic recognition of arabic stop consonants
Abdelkader Betari, Remy Bulot

Acoustic-phonetic decoding of Spanish occlusive consonants
I. Torres, P. Iparraguirre

Normalized vowel system representation for comparative phonetic studies
Philip Christov

Influence of prevocalic consonant on vowel duration in French CV[p] utterances
Cécile Thilly

Temporal variation in consonant clusters in Swedish
Peter Czigler

Discriminant analysis of continuous consonantal spectra
Wiktor Jassem






Segmentation and Labelling


Robust endpoint detection of speech in the presence of noise
Maria Rangoussi, Stylianos Bakamidis, George Carayannis

Automatic segmentation and labeling of English and Italian speech databases
B. Angelini, F. Brugnara, D. Falavigna, D. Giuliani, R. Gretter, M. Omologo

A segmental approach versus a centisecond one for automatic phonetic time-alignment
Azarshid Farhat, Guy Perennou, Regine Andre-Obrecht

A segmentation algorithm based on acoustical features using a self organizing neural network
I. Heroaez, J. Barandiaran, E. Monte, B. Etxebarria

SLAM: segmentation and labelling automatic module
Piero Cosi

Phone and syllable segmentation by concurrent window modules
Christian Heise, Hans-H. Bothe

Reliability of speech segmentation and labelling at different levels of transcription
Barbara Eisen

On the perception of acoustic and lexical vowel reduction
Dick R. van Bergem

Click detection in Italian and English
Brit van Ooyen, Anne Cutler, Pier Marco Bertinetto

Phonological variation and mismatch in lexical access
Andrew Nix, Gareth Gaskell, William Marslen-Wilson

Perception of word boundaries by dutch listeners
Monique van Zon, Beatrice de Gelder

Perception of French stop bursts, implications for stop identification
Anne Bonneau, Linda Djezzar, Yves Laprie

Using isofrequency neural column for harmonic sound scene decomposition
Zdravko Kacic, Bogomir Horvat

Do ear perceive vowel through formants?
A. K. Datta

Speech recognition using auditory models and neural networks
Trupti Vyas, Michael J. Pont, Seyed J. Mashari

The influence of temporal processes on spectral masking patterns of harmonic complex tones and vowels
Changxue Ma, Armin Kohlrausch

Temporal effect on the perception of continuous speech and a possible mechanism in the human auditory system
Hisao Kuwabara

Comparison of various adaptation mechanisms in an auditory model for the purpose of speech processing
Edward Jones, Eliathamby Ambikairajah

Sensory-motor manifestations of speech-hearing interaction
I. A. Vartanian, T. V. Chernigovskaya

Syllable perception: lateralization of native and foreign languages
T. V. Chernigovskaya, I. A. Vartanian, T. I. Tokareva

Simulation of short-latency auditory evoked potentials: a pilot study
Michael J. Pont

Intermediate representations in spoken word recognition: a cross-linguistic study of word illusions
Regine Kolinsky, Jose Morais

Time - varing manner on formant trajectories of Chinese diphthongs
Jianfen Cao

Iterative transformation and alignment for speech labeling
Yifan Gong, Jean-Paul Haton

Controlling search in segmentation lattices of speech signals
Kai Hübener, Andreas Hauenstein

Accent phrase segmentation using transition probabilities between pitch pattern templates
Hiroshi Shimodaira, Mitsuru Nakai

Syllable segmentation of continuous speech with artificial neural networks
W. Reichl, Günther Ruske

Labelling of speech given its text representation
Mats Blomberg, Rolf Carlson





Speech Synthesis


Joint arabic-hebrew speech synthesis system
M. Ouadou, A. Rajouani, M. Zyoute, J. Rosenfeld, M. Najim

Improvements of the Spanish version of the multivox text-to-speech system
Eduardo Lopez-Gonzalo, Gabor Olaszy, Geza Nemeth

Generating intonation for Swedish text-to-speech conversion using a quantitative model for the F0 contour
Mats Ljungqvist, Hiroya Fujisaki

PHRITTS - a text-to-speech synthesizer for the German language
P. Meyer, Hans-Wilhelm Rühl, R. Krüger, M. Kugler, L. L. M. Vogten, A. Dirksen, Karim Belhoula

Rule-based grapheme-to-phoneme conversion of names
Karim Belhoula

A prototype text-to-speech system for scottish gaelic
Iain R. Murray, Morag M. Black

A text-to-speech system for polish
Janusz Imiolczyk, Ignacy Nowak, Grazyna Demenko

Intelligibility as a function of speech coding method for template-based speech synthesis
Marian Macchi, Mary Jo Altom, Dan Kahn, Sharad Singhal, Murray F. Spiegel

Pronunciation and text normalisation in applied text-to-speech systems
Maggie Gaved

Evaluating synthesised prosody in simulations of an automated telephone enquiry service
Jill House, Catriona MacDermid, Scott McGlashan, Andrew Simpson, Nick Youd

Speech synthesis in dialogue systems
Katherine Morton, Marcel Tatham

Applying analysis of human emotional speech to enhance synthetic speech
Elissaveta Abadjieva, Iain R. Murray, John L. Arnott

A generic front end for text-to-speech synthesis systems
Eric Lewis, Marcel Tatham

Experiments with silent-e and affix correspondences in stochastic phonographic transduction
Robert W. P. Luk, Robert I. Damper

Phoneme-dependent speech synthesis in the time and frequency domains
Georg Fries

Speech synthesis experiments with the glove synthesiser
Inger Karlsson, Lennart Neovius

Auditory detection of discontinuities in synthesis-by-concatenation
Volker Kraft

Effects of the phase jitters on naturalness of synthesized speech
Yun-Keun Lee, Seung-Kwon Ahn

Letter-to-sound rules for the welsh language
Briony Williams








Speech Processing and Coding


Generalized frequency domain adaptive filter for acoustic echo canceller
F. Dohnal

Estimation of speech signal classification features in a simulated hyperbaric environment
J. Crestel, M. Guitton

Noise suppression system for a car
Petr Pollak, Pavel Sovka, Jan Uhlir

Adaptive gain control and echo cancellation for hands-free telephone systems
Peter Heitkamper, Michael Walker

Predicting segmental durations for accommodation within a syllable-level timing framework
W. Nick Campbell

A filtersank based on physiologically measured characteristics in an auditory model for speech signal processing
Tore Fjallbrant, Fisseha Mekuria, Shahrokh Amirijoo

Spectral sensitivity weighted transform coding for LSP parameters
Fu-Rong Jean, Chih-Chung Kuo, Hsiao-Chuan Wang

An efficient algorithm to estimate the instantaneous SNR of speech signals
Rainer Martin

Speech/non-speech detection for voice response systems
L. Mauuary, J. Monne

Time-spectral approach to compiling speech reconstruction
Alexander Osipov, Vladimir Zentsov

A voice activity detector based on cepstral analysis
J. A. Haigh, J. S. Mason

High quality coding of wideband speech at 24 kbit/s
Jürgen Paulus, Christiane Antweiler, Christian G. Gerlach

A 32 kbit/s wideband speech coder based on transform coding
H. Dia, Gang Feng, Y. Mahieux

Realtime implementation of high-quality 32 kbps wideband LD-CELP coder
Oded Gottesman, Yair Shoham

A fixed-point implementation of the 16 kb/s LD-CELP speech coding algorithm
A. Popescu, D. Vicard, F. Druilhe

Optimality of sequential quantization in analysis-by-synthesis speech codecs
Christian G. Gerlach

A sub-band MPLPC coder for high quality speech coding at 16 kbit/s
Radwan Kastantin, Gang Feng

Optimal multepulse excitation determination by simulated annealing
Enzo Mumolo, Alessio Rebelli

Split vector quantization of the LPC parameters using weighted lattice structure
K. W. Law, C. F. Chan

A new approach to noiseless interframe coding of LPC parameters in vector quantizer applications
Stefan Bruhn

Efficient quantization of speech spectral information
Torbjörn Svendsen

Enhancing robustness of coded LPC-spectra to channel errors by use of residual redundancy
Stefan Feldes

Multi-rate source and channel coding for mobile communication systems
S. A. Atungsiri, A. M. Kondoz, B. G. Evans

Training method of the excitation codebook for CELP
Takehiro Moriya, Satoshi Miki, Kazunori Mano, Hitoshi Ohmuro




Speech Translation, Language Identification, Parsers


ATREUS: a speech recognition front-end for a speech translation system
Shigeki Sagayama, Jun-ichi Takami, Akito Nagai, Harald Singer, Kouichi Yamaguchi, Kzumi Ohkura, Kenji Kita, Akira Kurematsu

ATR's speech translation system: ASURA
Tsuyoshi Morimoto, Toshiyuki Takezawa, Fumihiro Yato, Shigeki Sagayama, Toshihisa Tashiro, Masaaki Nagata, Akira Kurematsu

Recent advances in JANUS: a speech translation system
Monika Woszczyna, N. Coccaro, A. Eisele, A. Lavie, A. McNair, T. Polzin, Ivica Rogina, C. P. Rose, Tilo Sloboda, M. Tomita, J. Tsutsumi, N. Aoki-Waibel, Alex Waibel, Wayne Ward

Spoken language translation with MID-90's technology: a case study
Manny Rayner, Ivan Bretan, David Carter, Michael Collins, Vassilios Digalakis, Bjorn Gamback, Jaan Kaja, Jussi Karlgren, Bertil Lyberg, Stephen Pulman, Patti Price, Christer Samuelsson

Automatic language identification using a segment-based approach
Timothy J. Hazen, Victor W. Zue

A comparison of approaches to automatic language identification using telephone speech
Yeshwant Muthusamy, Kay Berkling, Takayuki Arai, Ronald Cole, Etienne Barnard

Integration of neural networks and robust parsers in natural language understanding
Ying Cheng, Yves Normandin, Paul Fortier

Joint speech and gesture analysis some experimental results on multimodal interface
Pierre Dauchy, Christophe Mignot, Claude Valot

Generation of speech reply in the speech response system
Keikichi Hirose, Yasuharu Asano

A fast multilingual probabilistic tagger
Evangelos Dermatas, George Kokkinakis

The possibility for acquisition of statistical network grammar using ergodic HMM
Jin'ichi Murakami, Hiroki Yamatomo, Shigeki Sagayama

A robust analyzer for spoken language understanding
Evelyne Millien, Roland Kuhn

Identifying usability attributes of automated telephone services
R. T. Dutton, John C. Foster, Mervyn A. Jack, F. W. Stentiford

Utilising prosody to perform syntactic disambiguation
Andrew Hunt

Spell: an automated system for computer-aided pronunciation teaching
Steven Hiller, Edmund Rooney, Jean-Paul Leffevre, Mervyn A. Jack

Training vowel pronunciation using a computer-aided teaching system
Edmund Rooney, Rebecca Vaughan, Steven Hiller, Fabrizio Carraro, John Laver

Methods for traversing a pre-recorded speech message network to optimise dialogue in telephone answering systems
Mary Zajicek, Ken Brownsey

Service creation tools for creating speech interactive services
Roger Hanes, Jo Salter, Paul Popay, Frances Hedley

Deaccentuation and persistence of grammatical function and surface position
Julia Hirschberg, Jacques Terken

Design and implementation of a speech server for unix based multimedia applications
Stefan Euler, K. Riedel

Romaine: a lattice based approach to lexical access
David Goodine, Victor W. Zue

A system for clustering spoken documents
Toffee A. Albina, Erica G. Bernstein, David M. Goblirsch, Douglas E. Lake







Speech Recognition, HMMs, NNs


Multiple codebook Spanish phone recognition using semicontinuous hidden Markov models
I. Torres, Francisco Casacuberta

An efficient algorithm to find the best state sequence in HSMM
Antonio Bonafonte, Xavier Ros, Jose B. Marifio

Robust HMM-based endpoint detector
Alex Acero, C. Crespo, C. de la Torre, J. C. Torrecilla

Experiments on Spanish phone recognition using automatically derived phonemic baseforms
Isabel Galiano, Francisco Casacuberta

Evaluation of VQ-distortion based HMM
Seiichi Nakagawa, Hideyuki Suzuki, Li Zhao

Continuous HMM for word spotting and rejection of non vocabulary word in speech recognition over telephone networks
Jianming Song

Bayesian learning of the parameters of discrete and tied mixture HMMs for speech recognition
Qiang Huo, Chorkin Chaw, Chin-Hui Lee

Speech recognition using semantic hidden Markov networks
Gernot A. Fink, Franz Kummert, Gerhard Sagerer, Ernst G. Schukat-Talamazzini

Experiments in vocabulary independent speech recognition using phoneme decision trees
Simon Downey, Martin Russell, Peter Nowell, David Bijl, Kirsta Galloway, Keith Ponting

Segmental hidden Markov models
M. J. F. Gales, Steve J. Young

Impact of dimensionality and correlation of observation vectors in HMM-based speech recognition
Xue Wang, Louis F. M. ten Bosch, Louis C. W. Pols

Evaluation of an HMM speech recognizer with various continuous speech databases
F. Class, A. Kaltenmeier, Peter Regel-Brietzmann

Hidden Markov models for noisy speech recognition
Adam Wrzoskowicz

Neural network speech enhancer utilizing masking properties
D. E. Tsoukalas, J. Mourjopoulos, George Kokkinakis

Comparison of geometric, connections and structural techniques on a difficult isolated word recognition task
Maria J. Castro, Juan C. Perez

Prediction and discrimination in neural networks for continuous speech recognition
A. Mellouk, P. Gallinari, F. Rauscher

Two schemes of phonetic feature extraction using artificial neural networks
Shuping Ran, J. Bruce Millar

On use of discriminant analysis in predictive connectionist speech recognition
Bojan Petek, Anuska Ferligoj

Non-linear time compression for lexical access
N. H. Russell, Frank Fallside, R. W. Prager

Talker enrollment for speech recognition by synthesis
Richard Brierton, Nigel Sedgwick

Improving robustness of network grammar by using class HMM
Kauzya Takeda, Naomi Inoue, Shingo Kuroiwa, Tomohiro Konuma, Seiichi Yamamoto

Parallelising k-means clustering on distributed memory MIMD computers
J. A. Elliott, M. E. Forsyth, F. R. McInnes, N. W. Ramsey

On the proper sub-word unit inventory for CSR
P. Berenyi, Klára Vicsi

Speech recognition using the atomic speech units constructed from overlapping articulatory features
Li Deng, Don Sun

A Bayesian approach to phone duration adaptation for lombard speech recognition
Olivier Siohan, Yifan Gong, Jean-Paul Haton

Multiple multilabeling to improve HMM-based speech recognition in noise
J. Hernando, Jose B. Marino, Climent Nadeu

Discrimination of polish stop consonants based on mapped techniques
Lutoslawa Richter, Piotr Domagaia







Telecommunication, Application Aspects


Speech recognition over packetized voice systems
Bo Baungaard, Jorn Stern Nielsen

Voice applications on BT's derived services network
I. W. G. Jenkins

A French oral dialogue system for flight reservations over the telephone
Jean-Yves Magadur, Frédéric Gavignet, Francois Andry, Francis Charpentier

A voice-activated extension telephone exchange system
Shingo Kuroiwa, Kazuya Takeda, Naomi Inoue, Izuru Nogaito, Seiichi Yamamoto, Makoto Shouzakai, Kunihiko Owa, Masahiko Takahashi, Ryuuji Matsumoto

The VOIS project in retrospect
William C. G. Ortel, Dina Yashchin

TELEMACO - a real time keyword spotting application for voice dialling
Eduardo Lleida, Jose B. Marino, Arturo Moreno

The relative importance of the factors affecting recogniser performance with telephone speech
Peter Wyard

A robust acoustic echo canceller for a hands-free voice-controlled telecommunication terminal
Thomas Burger, Ulrich Schultheiß

Polyphase allpass IIR structures for sub-band acoustic echo cancellation
J. E. Hart, P. A. Naylor, O. Tanrikulu

Speech input systems and their effect on written language skills
James Monaghan, Christine Cheepen

Voxaid: an interactive speaking communication aid software for the speech impaired
Gabor Olaszy, Geza Nemeth

Feature extraction for profoundly deaf people
U. Hartmann, K. Hermansen, F. K. Fink

Architecture of a 10,000 word real time speech recognizer
Alfred Hauenstein

A noise-robust real-time word recognition hardware module
Thomas Hermann, Harald Eckhardt, Michael Trompf, Heidi Hackbarth

KARS: a speaker-independent, vocabulary-independent speech recognition system
Myoung-Wan Koo

A parallel processing keyword recogniser for police national computer enquiries
F. R. McInnes, J. A. Elliott, N. W. Ramsey, M. E. Forsyth, A. M. Sutherland, Mervyn A. Jack

Cost232: speech recognition over the telephone line
Andrea Paoloni, Torbjörn Svendsen, B. Kaspar, Denis Johnston, Gunnar Hult

Individual variability in the perception of synthetic speech
Valerie Hazan, Bo Shi

Speech recognition system and its application for blind PC users
Ye. K. Ludovic, V. V. Pilipenko, G. E. Tseitlin, L. I. Nagornaya, T. Terzian






Speech Analysis: Pitch and Prosody


Analysing prosody by means of a double tree structure
Berit Horvei, Georg Ottesen, Sveire Stensby

Prosody and discourse interpretation
Geneviève Caelen-Haumont

Duration modelling for the greek language
George Epitropakis, D. Tambakas, Nikos Fakotakis, George Kokkinakis

Prosody control of TTS-systems based on linguistic analysis
George Epitropakis, Nickolas Yiourgalis, George Kokkinakis

Prosody takes over: a prosodically guided dialog system
Ralf Kompe, Andreas Kießling, T. Kuhn, Marion Mast, Heinrich Niemann, Elmar Nöth, K. Ott, Anton Batliner

Integration of a prosodic component in an automatic speech recognition system
P. Langlais, Henri Meloni

Referent tracking in restricted texts using a lemmatized lexicon: implications for generation of intonation
Merle Horne, Marcus Filipsson, Mats Ljungqvist, Anders Lindström

Perceptual significance of focus accent in spoken Swedish
Robert Bannert

Pitch estimation of speech signal with the wavelet transform
Silvio Montresor, Marc Baudry

A spectral AMDF method for pitch extraction of noise-corrupted speech
Jae Yeol Rheem, Myung Jin Bae, Sou Guil Ann

A reliable postprocessor for pitch determination algorithms
Gao Yang, Henri Leich

Vowel pitch period extraction by models of neurones in the mammalian brain-stem
G. F. Meyer, William A. Ainsworth

Auto-regressive linear models of jitter
Jean Schoentgen, Raoul de Guchteneere

Larynx period detection methods in speech pattern hearing AIDS
Jianing Wei, David Howells, Andrew Faulkner, Adrian Fourcin

Fundamental frequency of dutch women: an evaluative study
Renee van Bezooijen







Complex Forms of Speech & Speaker Recognition


Detection and transcription of new words
B. Suhm, Monika Woszczyna, Alex Waibel

Efficient enumeration of sentence hypotheses in connected word recognition
Victor M. Jimenez, Andres Marzal, Enrique Vidal

Locating disfluencies in spontaneous speech: an acoustical analysis
Douglas O'Shaughnessy

Integration of phonological knowledge in a continuous speech recognition system
Roselyne Nguyen, Kamel Smaili, Jean-Paul Haton, Guy Perennou

Prosody and continuous speech recognition
Pierre Dumouchel, Douglas O'Shaughnessy

Spoken-language processing for restricted domains: a sublanguage approach
H. Bergmann, H.-H. Hamer, A. Noll, A. Paeseler, H. Tomaschewski

The use of state tying in continuous speech recognition
Steve J. Young, Phil C. Woodland

The HTK tied-state continuous speech recogniser
Phil C. Woodland, Steve J. Young

Combination of training criteria to improve continuous speech recognition
Laurence Devillers, Christian Dugast

Experiments with an articulatory speech recognizer
Igor Zlokarnik

Techniques for robust recognition in restricted domains
Giuliano Antoniol, Mauro Cettolo, Marcello Federico

Use of explicit context-dependent phonemic model in continuous speech recognition
Feriel Mouria, Yifan Gong, Jean-Paul Haton

Base transformation for environment adaptation in continuous speech recognition
Yifan Gong

Improved a-posteriori processing for keyword spotting
Baruch Mazor, Ming-Whei Feng

Single and multi-channel speech enhancement for a word spotting system
J. Ortega-Garcia, J. M. Paez-Borrallo, Luis A. Hernandez-Gomez

Estimating 'small' probabilities by leaving-one-out
Hermann Ney, Ute Essen

Semantic and pragmatically based re-recognition of spontaneous speech
Sheryl R. Young, Wayne Ward

Modeling of time constituents for speech understanding
Bernd Hildebrandt, Gernot A. Fink, Franz Kummert, Gerhard Sagerer

Phonetic segmentation method for the continuous czech speech recognition
Vaclav Matousek

Speech recognition applied to reading assistance for children: a baseline language model
Alexander G. Hauptmann, Lin L. Chase, Jack Mostow

Modelling speaker normalization by adapting the BIAS in a neural net
David J. M. Weenink, Louis C. W. Pols

Neural models for extracting speaker characteristics in speech modelization systems
T. Artieres, P. Gallinari

Influence of pattern compression on speaker verification
J. Zinke

A comparative study of speaker adaptation under realistic conditions
Florian Schiel

A comparison of speaker recognition techniques for telephone speech
D. A. Irvine, F. J. Owens

Speaker verification over telephone channels based on concatenated phonemic hidden Markov models
Johan de Veth, Guido Gallopyn, Hervé Bourlard

Speaker adaptation using a predictive model
Stephen Cox

Combining features via LDA in speaker recognition
Z. P. Sun, J. S. Mason

Neural networks for speech and speaker recognition through a digital telephone exchange
J. M. Elvira, R. A. Carrasco

Performance comparison of machine and human speaker verification
M. Mehdi Homayounpour, J. Philippe Goldman, Gérard Chollet, Jacqueline Vaissière

The effect of utterance length and content on speaker-verifier performance
M. I. Hannah, A. T. Sapeluk, Robert I. Damper, I. M. Roger

The use of pseudostationary segments for speaker identification
Antanas Lipeika, Joana Lipeikiene

Bayesian decision in the speaker recognition by acoustic parametrization of voice samples over telephone lines
A. Federico, Andrea Paoloni


×

Keynotes

Speech Coding

Articulatory Modelling

Voice Source Analysis and Modelling

HMM-Based Recognition System

Speech Signal Processing

Speaker Recognition

Data Bases, Speech Assessment, Noisy Speech

Phonetics

Phoneme Classification and Labelling

Duration Modelling in HMMs

Speaker Adaptation and Normalization

Speech Analysis, Articulatory Modelling

Prosody: Rhythm, Style, Emotion

Improved Algorithms for HMMs

Noisy Speech and Enhancement

Speaker Variability

Segmentation and Labelling

Prosody: Analysis and Modelling of F0 Contours

Speech Recognition in Noise

Speaker Independency

Speech Synthesis

Dialogue Structure

Language Modelling

Prosody: Prosodic Parameter Manipulation

New Architectures for Neural Networks

Noise Reduction and Channel Adaption

Word Spotting

Speech Processing and Coding

Prosody: Phrasing

MLPs and TDNNs for Speech Recognition

Speech Translation, Language Identification, Parsers

Dialogue Evalution

Data Bases

Letter to Sound and Architecture for TTS

Perception

Search Algorithms

Speech Recognition, HMMs, NNs

Spoken Language Dialogue

Speech Input/Output Assessment

Synthesis: Sound Generation

Hybrid HMMs/ANNs for Speech Recognition

Visual Cues

Telecommunication, Application Aspects

Spoken Language Dialogue Application

Synthesis: Articulatory and Source Modelling

Syntactical Constraints

Pathological Voice Analysis

Speech Analysis: Pitch and Prosody

Applications

Synthesis: Systems, Syntax, Prosody

Large Vocabulary Systems

Continuous Speech Recognition Systems

Human Factors

Complex Forms of Speech & Speaker Recognition