Table of Contents and Access to Abstracts
Plenary Talks
Lee, Chin-Hui:
"From decoding-driven to detection-based paradigms for automatic speech recognition",
paper P2.
Lee, Hyun-Bok:
"In search of a universal phonetic alphabet - theory and application of an organic visible speech-",
paper P3.
Vaissière, Jacqueline:
"From X-ray or MRU data to sounds through articulatory synthesis: towards an integrated view of the speech communication process",
paper P4.
Speech Recognition - Adaptation
Balakrishnan, Sreeram / Visweswariah, Karthik / Goe, Vaibhava:
"Stochastic gradient adaptation of front-end parameters",
1-4.
Raux, Antoine / Singh, Rita:
"Maximum - likelihod adaptation of semi-continuous HMMs by latent variable decomposition of state distributions",
5-8.
Huang, Chao / Chen, Tao / Chang, Eric:
"Transformation and combination of hiden Markov models for speaker selection training",
9-12.
Mak, Brian / Hsiao, Roger:
"Improving eigenspace-based MLLR adaptation by kernel PCA",
13-16.
Chatzichrisafis, Nikos / Digalakis, Vasilios / Diakoloukas, Vasilios / Harizakis, Costas:
"Rapid acoustic model development using Gaussian mixture clustering and language adaptation",
17-20.
Visweswariah, Karthik / Gopinath, Ramesh:
"Adaptation of front end parameters in a speech recognizer",
21-24.
Giuliani, Diego / Gerosa, Matteo / Brugnara, Fabio:
"Speaker normalization through constrained MLLR based transforms",
2893-2896.
Mu, Xiangyu / Zhang, Shuwu / Xu, Bo:
"Multi-layer structure MLLR adaptation algorithm with subspace regression classes and tying",
2897-2900.
Stemmer, Georg / Steidl, Stefan / Hacker, Christian / Nöth, Elmar:
"Adaptation in the pronunciation space for non-native speech recognition",
2901-2904.
Wang, Xuechuan / O'Shaughnessy, Douglas:
"Robust ASR model adaptation by feature-based statistical data mapping",
2905-2908.
Han, Zhaobing / Zhang, Shuwu / Xu, Bo:
"A novel target-driven generalized JMAP adaptation algorithm",
2909-2912.
Mak, Brian / Ho, Simon / Kwok, James T.:
"Speedup of kernel eigenvoice speaker adaptation by embedded kernel PCA",
2913-2916.
Jeon, Hyung Bae / Kim, Dong Kook:
"Maximum a posteriori eigenvoice speaker adaptation for Korean connected digit recognition",
2917-2920.
Wang, Wei / Zahorian, Stephen:
"Vocal tract normalization based on spectral warping",
2921-2924.
Tanaka, Koji / Ren, Fuji / Kuroiwa, Shingo / Tsuge, Satoru:
"Acoustic model adaptation for coded speech using synthetic speech",
2925-2928.
Suzuki, Motoyuki / Ogasawara, Hirokazu / Ito, Akinori / Ohkawa, Yuichi / Makino, Shozo:
"Speaker adaptation method for CALL system using bilingual speakers' utterances",
2929-2932.
Watanabe, Shinji:
"Acoustic model adaptation based on coarse/fine training of transfer vectors and its application to a speaker adaptation task",
2933-2936.
Tsai, Wei-Ho / Cheng, Shih-Sian / Wang, Hsin-Min:
"Speaker clustering of speech utterances using a voice characteristic reference space",
2937-2940.
Kim, Young Kuk / Song, Hwa Jeon / Kim, Hyung Soon:
"Performance improvement of connected digit recognition using unsupervised fast speaker adaptation",
2941-2944.
Kim, Hyung Soon / Song, Hwa Jeon:
"Simultaneous estimation of weights of eigenvoices and bias compensation vector for rapid speaker adaptation",
2945-2948.
Wölfel, Matthias:
"Speaker dependent model order selection of spectral envelopes",
2949-2952.
Bocchieri, Enrico / Riley, Michael / Saraclar, Murat:
"Methods for task adaptation of acoustic models with limited transcribed in-domain data",
2953-2956.
Fujii, Atsushi / Ishikawa, Tetsuya / Itou, Katsunobu / Akiba, Tomoyosi:
"Unsupervised topic adaptation for lecture speech retrieval",
2957-2960.
Liu, Haibin / Wu, Zhenyang:
"Mean and covariance adaptation based on minimum classification error linear regression for continuous density HMMs",
2961-2964.
Nagino, Goshu / Shozakai, Makoto:
"Design of ready-made acoustic model library by two-dimensional visualization of acoustic space",
2965-2968.
Spoken Language Identification, Translation and Retrieval I
Gauvain, Jean-Luc / Messaoudi, Abdel / Schwenk, Holger:
"Language recognition using phone latices",
25-28.
Huckvale, Mark:
"ACCDIST: a metric for comparing speakers' accents",
29-32.
Levit, Michael / Gorin, Allen / Haffner, Patrick / Alshawi, Hiyan / Nöth, Elmar:
"Aspects of named entity processing",
33-36.
Crego, Josep M. / Marino, José B. / Gispert, Adria de:
"Finite-state-based and phrase-based statistical machine translation",
37-40.
Schultz, Tanja / Jou, Szu-Chen / Vogel, Stephan / Saleem, Shirin:
"Using word latice information for a tighter coupling in speech translation systems",
41-44.
Misu, Teruhisa / Kawahara, Tatsuya / Komatani, Kazunori:
"Confirmation strategy for document retrieval systems with spoken dialog interface",
45-48.
Lee, Shi-Wook / Tanaka, Kazuyo / Itoh, Yoshiaki:
"Multilayer subword units for open-vocabulary spoken document retrieval",
1553-1556.
Itoh, Yoshiaki / Tanaka, Kazuyo / Lee, Shi-wook:
"An efficient partial matching algorithm toward speech retrieval by speech",
1557-1560.
Sedogbo, Celestin / Herry, Sebastien / Gas, Bruno / Zarader, Jean Luc:
"Language detection by neural discrimination",
1561-1564.
Córdoba, Ricardo de / Ferreiros, Javier / Sama, Valentin / Macias-Guarasa, Javier / D'Haro, Luis F. / Fernandez, Fernando:
"Language identification techniques based on full recognition in an air traffic control task",
1565-1568.
Hansen, John H. L. / Yapanel, Umit / Huang, Rongqing / Ikeno, Ayako:
"Dialect analysis and modeling for automatic classification",
1569-1572.
Ferragne, Emmanuel / Pellegrino, Francois:
"Rhythm in read british English: interdialect variability",
1573-1576.
Fung, Pascale / Liu, Yi / Yang, Yongsheng / Shen, Yihai / Wu, Dekai:
"A grammar-based Chinese to English speech translation system for portable devices",
1577-1580.
Tur, Gokhan:
"Cost-sensitive call classification",
1581-1584.
Kurimo, Mikko / Turunen, Ville / Ekman, Inger:
"An evaluation of a spoken document retrieval baseline system in finish",
1585-1588.
Jiang, Hui / Liu, Pengfei / Zitouni, Imed:
"Discriminative training of naive Bayes classifiers for natural language call routing",
1589-1592.
Moreau, Nicolas / Kim, Hyoung-Gook / Sikora, Thomas:
"Phonetic confusion based document expansion for spoken document retrieval",
1593-1596.
Chung, Euisok / Lim, Soojong / Hwang, Yi-Gyu / Jang, Myung-Gil:
"Hybrid named entity recognition for question-answering system",
1597-1600.
Ajmera, Jitendra / McCowan, Iain / Bourlard, Hervé:
"An online audio indexing system",
1601-1604.
Sanders, Eric / Wet, Febe de:
"Histogram normalisation and the recognition of names and ontology words in the MUMIS project",
1605-1608.
Amaral, Rui / Trancoso, Isabel:
"Improving the topic indexation and segmentation modules of a media watch system",
1609-1612.
Barkat-Defradas, Melissa / Hamdi, Rym / Ferragne, Emmanuel / Pellegrino, Francois:
"Speech timing and rhythmic structure in arabic dialects: a comparison of two approaches",
1613-1616.
Wang, Hsin-min / Cheng, Shih-sian:
"METRIC-SEQDAC: a hybrid approach for audio segmentation",
1617-1620.
Kuo, Jen-Wei / Huang, Yao-Min / Chen, Berlin / Wang, Hsin-min:
"Statistical Chinese spoken document retrieval using latent topical information",
1621-1624.
Masahiko, Matsushita / Nishizaki, Hiromitsu / Nakagawa, Seiichi / Utsuro, Takehito:
"Keyword recognition and extraction by multiple-LVCSRs with 60,000 words in speech-driven WEB retrieval task",
1625-1628.
Zhang, Ruiqiang / Kikui, Genichiro / Yamamoto, Hirofumi / Soong, Frank K. / Watanabe, Taro / Sumita, Eiichiro / Lo, Wai-Kit:
"Improved spoken language translation using n-best speech recognition hypotheses",
1629-1632.
Wong, Kakeung / Siu, Man-hung:
"Automatic language identification using discrete hidden Markov model",
1633-1636.
Zhou, Bowen / Dechelotte, Daniel / Gao, Yuqing:
"Two-way speech-to-speech translation on handheld devices",
1637-1640.
Blanchon, Hervé:
"HLT modules scalability within the NESPOLE! project",
1641-1644.
Linguistics, Phonology, and Phonetics
Kim, Midam:
"Correlation between VOT and F0 in the perception of Korean stops and affricates",
49-52.
Noiray, Aude / Menard, Lucie / Cathiard, Marie-Agnes / Abry, Christian / Savariaux, Christophe:
"The development of anticipatory labial coarticulation in French: a pionering study",
53-56.
Hunt, Melvyn John:
"Speech recognition, sylabification and statistical phonetics",
57-60.
Tian, Jilei:
"Data-driven approaches for automatic detection of syllable boundaries",
61-64.
Cutler, Anne / Norris, Dennis / Sebastian-Galles, Nuria:
"Phonemic repertoire and similarity within the vocabulary",
65-68.
Maskey, Sameer / Black, Alan / Tomokiya, Laura:
"Boostrapping phonetic lexicons for new languages",
69-72.
Broersma, Mirjam / Kolkman, K. Marieke:
"Lexical representation of non-native phonemes",
1241-1244.
Lee, Jong-Pyo / Jang, Tae-Yeoub:
"A comparative study on the production of inter-stress intervals of English speech by English native speakers and Korean speakers",
1245-1248.
Murano, Emi Zuiki / Teshigawara, Mihoko:
"Articulatory correlates of voice qualities of god guys and bad guys in Japanese anime: an MRI study",
1249-1252.
Dusan, Sorin:
"Effects of phonetic contexts on the duration of phonetic segments in fluent read speech",
1253-1256.
Fang, Qiang:
"A study on nasal coda los in continuous speech",
1257-1260.
Jian, Hua-Li:
"An improved pair-wise variability index for comparing the timing characteristics of speech",
1261-1264.
Jian, Hua-Li:
"An acoustic study of speech rhythm in taiwan English",
1265-1268.
Kim, Sung-A:
"Language specific phonetic rules: evidence from domain-initial strengthening",
1269-1272.
Park, Hansang:
"Spectral characteristics of the release bursts in Korean alveolar stops",
1273-1276.
Son, Rob Van / Bolotova, Olga / Pols, Louis C. W. / Lennes, Mietta:
"Frequency effects on vowel reduction in three typologically different languages (dutch, finish, Russian)",
1277-1280.
Abresch, Julia / Breuer, Stefan:
"Assessment of non-native phones in anglicisms by German listeners",
1281-1284.
Kim, Sunhee:
"Phonology of exceptions for for Korean grapheme-to-phoneme conversion",
1285-1289.
Shigeyoshi, Kitazawa / Kiriyama, Shinya:
"Acoustic and prosodic analysis of Japanese vowel-vowel hiatus with laryngeal effect",
1289-1293.
Tsukada, Kimiko:
"A cross-linguistic acoustic comparison of unreleased word-final stops: Korean and Thai",
1293-1296.
Cho, Taehong / Johnson, Elizabeth K.:
"Acoustic correlates of phrase-internal lexical boundaries in dutch",
1297-1300.
Cho, Taehong / McQueen, James M.:
"Phonotactics vs. phonetic cues in native and non-native listening: dutch and Korean listeners' perception of dutch and English",
1301-1304.
Kaminskaia, Svetlana / Poire, Francois:
"Comparing intonation of two varieties of French using normalized F0 values",
1305-1308.
Oh, Mira / Kim, Kee-Ho:
"Phonetic realization of the suffix-suppressed accentual phrase in Korean",
1309-1312.
Bunnell, H. Timothy / Polikoff, James / McNicholas, Jane:
"Spectral moment vs. bark cepstral analysis of children's word-initial voiceles stops",
1313-1316.
Minematsu, Nobuaki:
"Pronunciation assessment based upon the compatibility between a learner's pronunciation structure and the target language's lexical structure",
1317-1320.
Yoshida, Kenji:
"Spread of high tone in akita Japanese",
1321-1324.
Biomedical Applications of Speech Analysis
Godino-Llorente, Juan-Ignacio / Rodellar-Biarge, Victoria / Gomez-Vilda, Pedro / Diaz-Perez, Francisco / Alvarez-Marquina, Agustin / Martinez-Olalla, Rafael:
"Biomechanical parameter fingerprint in the mucosal wave power spectral density",
73-76.
Jo, Cheolwoo / Wang, Soo-Geon / Yang, Byung-Gon / Kim, Hyung-Soon / Li, Tao:
"Classification of pathological voice including severely noisy cases",
77-80.
Fu, Qiang / Murphy, Peter:
"A robust glottal source model estimation technique",
81-84.
Mori, Hiroki / Kobayashi, Yasunori / Kasuya, Hideki / Hirose, Hajime / Kobayashi, Noriko:
"F0 and formant frequency distribution of dysarthric speech - a comparative study",
85-88.
Kawahara, Hideki / Hirachi, Yumi / Masanori, Morise / Banno, Hideki:
"Procedure "senza vibrato": a key component for morphing singing",
89-92.
Manfredi, Claudia / Peretti, Giorgio / Magnoni, Laura / Dori, Fabrizio / Iadanza, Ernesto:
"Thyroplastic medialisation in unilateral vocal fold paralysis: assessing voice quality recovering",
93-96.
Kubin, Gernot / Hagmueller, Martin:
"Voice enhancement of male speakers with laryngeal neoplasm",
541-544.
Choi, Jong Min / Sung, Myung-Whun / Park, Kwang Suk / Hah, Jeong-Hun:
"A comparison of the perturbation analysis between PRAAT and computerize speech lab",
545-548.
Robust Speech Recognition on AURORA
Ji, Ming / Hou, Baochun:
"Evaluation of universal compensation on Aurora 2 and 3 and beyond",
97-100.
hamme, Hugo Van:
"PROSPECT features and their application to missing data techniques for robust speech recognition",
101-104.
hamme, Hugo Van / Wambacq, Patrick / Stouten, Veronique:
"Accounting for the uncertainty of speech estimates in the context of model-based feature enhancement",
105-108.
Hirsch, Hans-Guenter / Finster, Harald:
"Applying the Aurora feature extraction schemes to a phoneme based recognition task",
109-112.
Zhang, Zhipeng / Ohya, Tomoyuki / Furui, Sadaoki:
"Evaluation of tree-structured piecewise linear transformation-based noise adaptation on AURORA2 database",
113-116.
Myrvoll, Tor Andre / Nakamura, Satoshi:
"Online minimum mean square error filtering of noisy cepstral coefficients using a sequential EM algorithm",
117-120.
Sasou, Akira / Tanaka, Kazuyo / Nakamura, Satoshi / Asano, Futoshi:
"HMM-based feature compensation method: an evaluation using the AURORA2",
121-124.
Wang, Xuechuan / O'Shaughnessy, Douglas:
"Noise adaptation for robust AURORA 2 noisy digit recognition using statistical data mapping",
125-128.
Shannon, Benjamin J. / Paliwal, Kuldip K.:
"MFCC computation from magnitude spectrum of higher lag autocorrelation coefficients for robust speech recognition",
129-132.
Muhammad, Ghulam / Fukuda, Takashi / Horikawa, Junsei / Nitta, Tsuneo:
"A noise-robust feature extraction method based on pitch-synchronous ZCPA for ASR",
133-136.
Segura, José Carlos / Torre, Angel De la / Ramirez, Javier / Rubio, Antonio J. / Benitez, Carmen:
"Including uncertainty of speech observations in robust speech recognition",
137-140.
Yamada, Takeshi / Okada, Jiro / Kitawaki, Nobuhiko:
"Integration of n-best recognition results obtained by multiple noise reduction algorithms",
141-144.
Setiawan, Panji / Stan, Sorel / Fingscheidt, Tim:
"Revisiting some model-based and data-driven denoising algorithms in Aurora 2 context",
145-148.
Ding, Guo-Hong / Xu, Bo:
"Exploring high-performance speech recognition in noisy environments using high-order taylor series expansion",
149-152.
Au, Wing-Hei / Siu, Man-Hung:
"A robust training algorithm based on neighborhood information",
153-156.
Lee, Siu Wa / Ching, Pak Chung:
"In-phase feature induction: an effective compensation technique for robust speech recognition",
157-160.
Yeung, Siu-Kei Au / Siu, Man-Hung:
"Improved performance of Aurora 4 using HTK and unsupervised MLLR adaptation",
161-164.
Tsai, Shang-nien / Lee, Lin-shan:
"A new feature extraction front-end for robust speech recognition using progressive histogram equalization and multi-eigenvector temporal filtering",
165-168.
Spoken / Multimodal Dialogue System
Fügen, Christian / Holzapfel, Hartwig / Waibel, Alex:
"Tight coupling of speech recognition and dialog management - dialog-context dependent grammar weighting for speech recognition",
169-172.
Lee, Akinobu / Nakamura, Keisuke / Nisimura, Ryuichi / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Noise robust real world spoken dialogue system using GMM based rejection of unintended inputs",
173-176.
Oshikawa, Hironori / Kitaoka, Norihide / Nakagawa, Seiichi:
"Speech interface for name input based on combination of recognition methods using syllable-based n-gram and word dictionary",
177-180.
Zitouni, Imed / Lee, Minkyu / Jiang, Hui:
"Constrained minimization technique for topic identification using discriminative training and support vector machines",
181-184.
Williams, Jason D. / Young, Steve:
"Characterizing task-oriented dialog using a simulated ASR chanel",
185-188.
Konashi, Takashi / Suzuki, Motoyuki / Ito, Akinori / Makino, Shozo:
"A spoken dialog system based on automatic grammar generation and template-based weighting for autonomous mobile robots",
189-192.
Ito, Akinori / Oba, Takanobu / Konashi, Takashi / Suzuki, Motoyuki / Makino, Shozo:
"Noise adaptive spoken dialog system based on selection of multiple dialog strategies",
193-196.
Hartikainen, Mikko / Turunen, Markku / Hakulinen, Jaakko / Salonen, Esa-Pekka / Funk, J. Adam:
"Flexible dialogue management using distributed and dynamic dialogue control",
197-200.
Houck, Keith:
"Contextual revision in information seeking conversation systems",
201-204.
O'Neill, Ian / Hanna, Philip / Liu, Xingkun / McTear, Michael:
"Cross domain dialogue modelling: an object-based approach",
205-208.
Sagawa, Hirohiko / Mitamura, Teruko / Nyberg, Eric:
"A comparison of confirmation styles for error handling in a speech dialog system",
209-212.
Yang, Fan / Heeman, Peter A.:
"Using computer simulation to compare two models of mixed-initiative",
213-216.
Yang, Fan / Heeman, Peter A. / Hollingshead, Kristy:
"Towards understanding mixed-initiative in task-oriented dialogues",
217-220.
Wolf, Peter / Woelfel, Joseph / Gemert, Jan Van / Raj, Bhiksha / Wong, David:
"Spokenquery: an alternate approach to chosing items with speech",
221-224.
Douglas, Shona / Agarwal, Deepak / Alonso, Tirso / Bell, Robert / Rahim, Mazin / Swayne, Deborah F. / Volinsky, Chris:
"Mining customer care dialogs for "daily news"",
225-228.
Edlund, Jens / Skantze, Gabriel / Carlson, Rolf:
"Higgins - a spoken dialogue system for investigating error handling techniques",
229-232.
Weng, Fuliang / Cavedon, Lawrence / Raghunathan, Badri / Mirkovic, Danilo / Cheng, Hua / Schmidt, Hauke / Bratt, Harry / Mishra, Rohit / Peters, Stanley / Upson, Sandra / Shriberg, Elizabeth / Bergmann, Carsten / Zhao, Lin:
"A conversational dialogue system for cognitively overloaded users",
233-236.
Hanrieder, Gerhard / Hamerich, Stefan W.:
"Modeling generic dialog applications for embedded systems",
237-240.
Stuttle, Matthew N. / Williams, Jason D. / Young, Steve:
"A framework for dialogue data collection with a simulated ASR channel",
241-244.
Pan, Shimei:
"A multi-layer conversation management approach for information seeking applications",
245-248.
Harris, Thomas Kevin / Rosenfeld, Roni:
"A universal speech interface for appliances",
249-252.
Hayashi, Keita / Irie, Yuki / Yamaguchi, Yukiko / Matsubara, Shigeki / Kawaguchi, Nobuo:
"Speech understanding, dialogue management and response generation in corpus-based spoken dialogue system",
253-256.
Fernandez, Fernando / Sama, Valentin / D'Haro, Luis F. / San-Segundo, Ruben / Córdoba, Ricardo de / Montero, Juan Manuel:
"Implementation of dialog applications in an open-source voiceXML platform",
257-260.
Lau, Chun Wai / Ma, Bin / Meng, Helen Mei-Ling / Moon, Yiu-Sang / Yam, Yeung:
"Fuzzy logic decision fusion in a multimodal biometric system",
261-264.
Poller, Peter / Reithinger, Norbert:
"A state model for the realization of visual perceptive feedback in smartkom",
265-268.
Iida, Akemi / Ueno, Yoshito / Matsuura, Ryohei / Aikawa, Kiyoaki:
"A vector-based method for efficiently representing multivariate environmental information",
269-272.
Toptsis, Ioannis / Li, Shuyin / Wrede, Britta / Fink, Gernot A.:
"A multi-modal dialog system for a mobile robot",
273-276.
Bernsen, Niels Ole / Dybkjaer, Laila:
"Structured interview-based evaluation of spoken multimodal conversation with h.c. andersen",
277-280.
Speech Recognition - Search
Novak, Miroslav / Bergl, Vladimir:
"Memory efficient decoding graph compilation with wide cross-word acoustic context",
281-284.
Zhang, Dongbin / Du, Limin:
"Dynamic beam pruning strategy using adaptive control",
285-288.
Hori, Takaaki / Hori, Chiori / Minami, Yasuhiro:
"Fast on-the-fly composition for weighted finite-state transducers in 1.8 million-word vocabulary continuous speech recognition",
289-292.
Yu, Peng / Seide, Frank Torsten Bernd:
"A hybrid word / phoneme-based approach for improved vocabulary-independent search in spontaneous speech",
293-296.
Smidl, Lubos / Müller, Ludek:
"Keyword spotting for highly inflectional languages",
297-300.
Tendeau, Frédéric:
"Optimizing an engine network that allows dynamic masking",
301-304.
Spoken Dialogue and Systems
Ohtsuki, Katsutoshi / Hiroshima, Nobuaki / Hayashi, Yoshihiko / Bessho, Katsuji / Matsunaga, Shoichi:
"Topic structure extraction for meeting indexing",
305-308.
Rosset, Sophie / Lamel, Lori:
"Automatic detection of dialog acts based on multilevel information",
309-312.
Levow, Gina-Anne:
"Identifying local corrections in human-computer dialogue",
313-316.
Reichl, Peter / Hammer, Florian:
"Hot discussion or frosty dialogue? towards a temperature metric for conversational interactivity",
317-320.
Seneff, Stephanie / Wang, Chao / Hetherington, Lee / Chung, Grace:
"A dynamic vocabulary spoken dialogue interface",
321-324.
Denecke, Matthias / Dohsaka, Kohji / Nakano, Mikio:
"Learning dialogue policies using state aggregation in reinforcement learning",
325-328.
Speech Perception
Shatzman, Keren B.:
"Segmenting ambiguous phrases using phoneme duration",
329-332.
Sakamoto, Shuichi / Suzuki, Yo-iti / Amano, Shigeaki / Kondo, Tadahisa / Iwaoka, Naoki:
"A compensation method for word-familiarity difference with SNR control in intelligibility test",
333-336.
Otake, Takashi / Sakamoto, Yoko / Konomi, Yasuyuki:
"Phoneme-based word activation in spoken-word recognition: evidence from Japanese school children",
337-340.
Brahimi, Belynda / Mareuil, Philippe Boula de / Gendrot, Cedric:
"Role of segmental and suprasegmental cues in the perception of maghrebian-acented French",
341-344.
Kato, Hiroaki / Sagisaka, Yoshinori / Tsuzaki, Minoru / Muto, Makiko:
"Effect of speaking rate on the acceptability of change in segment duration",
345-348.
Yoneyama, Kiyoko:
"A cross-linguistic study of diphthongs in spoken word processing in Japanese and English",
349-352.
Multi-Lingual Speech-to-Speech Translation
Waibel, Alex:
"Speech translation: past, present and future",
353-356.
Kikui, Genichiro / Takezawa, Toshiyuki / Yamamoto, Seiichi:
"Multilingual corpora for speech-to-speech translation research",
357-360.
Ney, Hermann:
"Statistical machine translation and its challenges",
361-364.
Lee, John / Seneff, Stephanie:
"Translingual grammar induction",
365-368.
Lee, Youngjik / Park, Jun / Oh, Seung-Shin:
"Usability considerations of speech-to-speech translation system",
369-372.
Lazzari, Gianni / Waibel, Alex / Zong, Chengqing:
"Worldwide ongoing activities on multilingual speech to speech translation",
373-376.
Speech Recognition - Large Vocabulary
Fohr, Dominique / Mella, Odile / Cerisara, Christophe / Illina, Irina:
"The automatic news transcription system: ANTS, some real time experiments",
377-380.
Ramabhadran, Bhuvana / Siohan, Olivier / Zweig, Geoffrey:
"Use of metadata to improve recognition of spontaneous speech and named entities",
381-384.
Pylkkonen, Janne / Kurimo, Mikko:
"Duration modeling techniques for continuous speech recognition",
385-388.
Alumae, Tanel:
"Large vocabulary continuous speech recognition for estonian using morpheme classes",
389-392.
Han, Zhaobing / Zhang, Shuwu / Xu, Bo:
"Combining agglomerative and tree-based state clustering for high accuracy acoustic modeling",
393-396.
Wang, William S-Y. / Peng, Gang:
"Parallel tone score association method for tone language speech recognition",
397-400.
Zheng, Jing / Franco, Horacio / Stolcke, Andreas:
"Effective acoustic modeling for rate-of-speech variation in large vocabulary conversational speech recognition",
401-404.
Ghadiyaram, G.L. Sarada / Nagarajan, N. Hemalatha / Thangavelu, T. Nagarajan / Murthy, Hema A.:
"Automatic transcription of continuous speech using unsupervised and incremental training",
405-408.
Nouza, Jan / Nejedlova, Dana / Zdansky, Jindrich / Kolorenc, Jan:
"Very large vocabulary speech recognition system for automatic transcription of czech broadcast programs",
409-412.
Siohan, Olivier / Ramabhadran, Bhuvana / Zweig, Geoffrey:
"Speech recognition error analysis on the English MALACH corpus",
413-416.
Zhang, Rong / Rudnicky, Alexander:
"A frame level boosting training scheme for acoustic modeling",
417-420.
Zhang, Rong / Rudnicky, Alexander:
"Optimizing boosting with discriminative criteria",
421-424.
Xu, Xianghua / Guo, Qiang / Zhu, Jie:
"Restructuring HMM states for speaker adaptation in Mandarin speech recognition",
425-428.
Matton, Mike / Wachter, Mathias De / Compernolle, Dirk Van / Cools, Ronald:
"A discriminative locally weighted distance measure for speaker independent template based speech recognition",
429-432.
Itaya, Yohei / Zen, Heiga / Nankaku, Yoshihiko / Miyajima, Chiyomi / Tokuda, Keiichi / Kitamura, Tadashi:
"Deterministic annealing EM algorithm in parameter estimation for acoustic model",
433-436.
Grezl, Frantisek / Karafiat, Martin / Cernocky, Jan:
"TRAP based features for LVCSR of meting data",
437-440.
Soong, Frank K. / Lo, Wai Kit / Nakamura, Satoshi:
"Optimal acoustic and language model weights for minimizing word verification errors",
441-444.
Sako, Atsushi / Ariki, Yasuo:
"Structuring of baseball live games based on speech recognition using task dependant knowledge",
445-448.
Zhou, Zhengyu / Meng, Helen:
"A two-level schema for detecting recognition errors",
449-452.
Choi, In-Jeong / Kim, Nam-Hoon / Yoon, Su Youn:
"Large vocabulary continuous speech recognition based on cross-morpheme phonetic information",
453-456.
Ma, Changxue:
"Automatic phonetic base form generation based on maximum context tree",
457-460.
Hernandez-Abrego, Gustavo / Olorenshaw, Lex / Tato, Raquel / Schaaf, Thomas:
"Dictionary refinements based on phonetic consensus and non-uniform pronunciation reduction",
1697-1700.
Messaoudi, Abdel. / Lamel, Lori / Gauvain, Jean-Luc:
"Transcription of arabic broadcast news",
1701-1704.
Shinozaki, Takahiro / Furui, Sadaoki:
"Spontaneous speech recognition using a massively parallel decoder",
1705-1708.
Schultz, Tanja / Jin, Qin / Laskowski, Kornel / Pan, Yue / Metze, Florian / Fügen, Christian:
"Issues in meeting transcription - the ISL meeting transcription system",
1709-1712.
Ohtsuki, Katsutoshi / Hiroshima, Nobuaki / Matsunaga, Shoichi / Hayashi, Yoshihiko:
"Multi-pass ASR using vocabulary expansion",
1713-1716.
Doumpiotis, Vlasios / Byrne, William:
"Pinched lattice minimum Bayes risk discriminative training for large vocabulary continuous speech recognition",
1717-1720.
Shafran, Izhak / Byrne, William:
"Task-specific minimum Bayes-risk decoding using learned edit distance",
1945-1948.
Zhang, Rong / Rudnicky, Alexander:
"Apply n-best list re-ranking to acoustic model combinations of boosting training",
1949-1952.
Kim, D. Y. / Umesh, S. / Gales, M. J. F. / Hain, T. / Woodland, P. C.:
"Using VTLN for broadcast news transcription",
1953-1956.
Stolcke, Andreas / Wooters, Chuck / Bulyko, Ivan / Graciarena, Martin / Otterson, Scott / Peskin, Barbara / Ostendorf, Mari / Gelbart, Dave / Mirghafori, Nikki / Pirinen, Tuomo:
"From switchboard to meetings: development of the 2004 ICSI-SRI-UW meeting recognition system",
1957-1960.
Venkataraman, Anand / Stolcke, Andreas / Wang, Wen / Vergyri, Dimitra / Zheng, Jing / Gadde, Venkata Ramana Rao:
"An efficient repair procedure for quick transcriptions",
1961-1964.
Qian, Yao / Lee, Tan / Soong, Frank K.:
"Tone information as a confidence measure for improving Cantonese LVCSR",
1965-1968.
Speech Science
Due, Danielle:
"Temporal variables in parkinsonian speech",
461-464.
Engwall, Olov:
"Speaker adaptation of a three-dimensional tongue model",
465-468.
Cooper, Nicole / Cutler, Anne:
"Perception of non-native phonemes in noise",
469-472.
Kawahara, Hideki / Banno, Hideki / Irino, Toshio / Jin, Jiang:
"Intelligibility of degraded speech from smeared STRAIGHT spectrum",
473-476.
Kim, Young-Ik / Kil, Rhee Man:
"Sound source localization based on zero-crosing peak-amplitude coding",
477-480.
Sachiyo, Kajikawa / Laurel, Fais / Shigeaki, Amano / Janet, Werker:
"Adult and infant sensitivity to phonotactic features in spoken Japanese",
481-484.
Green, Phil / Carmichael, James:
"Revisiting dysarthria assessment intelligibility metrics",
485-488.
Ciocca, Valter / Whitehill, Tara L. / Ma, Joan K.-Y.:
"The effect of intonation on perception of Cantonese lexical tones",
489-492.
Isei-Jaakkola, Toshiko:
"Maximum short quantity in Japanese and finish in two perception tests with F0 and db variants",
493-496.
Alku, Paavo / Airas, Matti / Story, Brad:
"Evaluation of an inverse filtering technique using physical modeling of voice production",
497-500.
Hsu, Hui-ju / Fon, Janice:
"Positional and phonotactic effects on the realization of taiwan Mandarin tone 2",
501-504.
Schnell, Karl / Lacroix, Arild:
"Speech production based on lossy tube models: unit concatenation and sound transitions",
505-508.
Yan, Qin / Vaseghi, Saeed / Rentzos, Dimitrios / Ho, Ching-Hsiang:
"Modelling and ranking of differences across formants of british, australian and american accents",
509-512.
Kitamura, Tatsuya / Fujita, Satoru / Honda, Kiyoshi / Nishimoto, Hironori:
"An experimental method for measuring transfer functions of acoustic tubes",
513-516.
Tsuji, Takuya / Kaburagi, Tokihiko / Wakamiya, Kohei / Kim, Jiji:
"Estimation of the vocal tract spectrum from articulatory movements using phoneme-dependent neural networks",
517-520.
Motoki, Kunitoshi / Matsuzaki, Hiroki:
"Computation of the acoustic characteristics of vocal-tract models with geometrical perturbation",
521-524.
Vijayalakshmi, P. / Reddy, M. Ramasubba:
"Analysis of hypernasality by synthesis",
525-528.
Kacha, Abdellah / Grenez, Francis / Bettens, Frédéric / Schoentgen, Jean:
"Adaptive long-term predictive analysis of disordered speech",
529-532.
Jovicic, Slobodan / Antesevic, Sandra / Saric, Zoran:
"Phoneme restoration in degraded speech communication",
533-536.
Marinaki, Maria / Kotropoulos, Constantine / Pitas, Ioannis / Maglaveras, Nikolaos:
"Automatic detection of vocal fold paralysis and edema",
537-540.
Novel Features in ASR
Minami, Yasuhiro / McDermott, Erik / Nakamura, Atsushi / Katagiri, Shigeru:
"A theoretical analysis of speech recognition based on feature trajectory models",
549-552.
Ou, Zhijian / Zuoying, Wang:
"Discriminative combination of multiple linear predictions for speech recognition",
553-556.
Gharavian, Davood / Ahadi, Mohammad:
"Use of formants in stressed and unstressed continuous speech recognition",
557-560.
Markov, Konstantin / Nakamura, Satoshi / Dang, Jianwu:
"Integration of articulatory dynamic parameters in HMM/BN based speech recognition system",
561-564.
Alsteris, Leigh David / Paliwal, Kuldip K.:
"ASR on speech reconstructed from short-time fourier phase spectra",
565-568.
Spoken and Natural Language Understanding
Lieb, Robert / Fabian, Tibor / Ruske, Guenther / Thomae, Matthias:
"Estimation of semantic confidences on lattice hierarchies",
569-572.
Fukumoto, Fumiyo / Suzuki, Yoshimi:
"Learning subject drift for topic tracking",
573-576.
Shriberg, Elizabeth / Stolcke, Andreas / Hillard, Dustin / Ostendorf, Mari / Peskin, Barbara / Harper, Mary / Liu, Yang:
"The ICSI-SRI-UW metadata extraction system",
577-580.
Hasegawa-Johnson, Mark / Levinson, Stephen / Zhang, Tong:
"Automatic detection of contrast for speech understanding",
581-584.
Wang, Nick Jui-Chang / Shen, Jia-Lin / Tsai, Ching-Ho:
"Integrating layer concept inform ation into n-gram modeling for spoken language understanding",
585-588.
Chen, Junyan / Wu, Ji / Wang, Zuoying:
"A robust understanding model for spoken dialogues",
589-592.
Wutiwiwatchai, Chai / Furui, Sadaoki:
"Belief-based nonlinear rescoring in Thai speech understanding",
2129-2133.
Itoh, Toshihiko / Kai, Atsuhiko / Itoh, Yukihiro / Konishi, Tatsuhiro:
"An understanding strategy based on plausibility score in recognition history using CSR confidence measure",
2133-2136.
Jung, Sangkeun / Jeong, Minwoo / Lee, Gary Geunbae:
"Speech recognition error correction using maximum entropy language model",
2137-2140.
Li, Xiang / Huerta, Juan:
"Discriminative training of compound-word based multinomial classifiers for speech routing",
2141-2144.
Eun, Jihyun / Lee, Changki / Lee, Gary Geunbae:
"An information extraction approach for spoken language understanding",
2145-2148.
Horowitz, David / Lal, Partha / Buckley, Pierce Gerard:
"A maximum entropy shallow functional parser for spoken language understanding",
2149-2152.
Huang, Qiang / Cox, Stephen:
"Mixture language models for call routing",
2153-2156.
Wu, Chung-Hsien / Yeh, Jui-Feng / Chen, Ming-Jun:
"Speech act identification using an ontology-based partial pattern tree",
2157-2160.
Wang, Ye-Yi / Ju, Yun-Cheng:
"Creating speech recognition grammars from regular expressions for alphanumeric concepts",
2161-2164.
Trancoso, Isabel / Araujo, Paulo / Viana, Ceu / Mamede, Nuno:
"Poetry assistant",
2165-2168.
Kitade, Tasuku / Kawahara, Tatsuya / Nanjo, Hiroaki:
"Automatic extraction of key sentences from oral presentations using statistical measure based on discourse markers",
2169-2172.
Ohno, Tomohiro / Matsubara, Shigeki / Kawaguchi, Nobuo / Inagaki, Yasuyoshi:
"Robust dependency parsing of spontaneous Japanese speech and its evaluation",
2173-2176.
Minker, Wolfgang / Buehler, Dirk / Beuschel, Christiane:
"Strategies for optimizing a stochastic spoken natural language parser",
2177-2180.
Lee, Tzu-Lun / He, Ya-Fang / Huang, Yun-Ju / Tseng, Shu-Chuan / Eklund, Robert:
"Prolongation in spontaneous Mandarin",
2181-2184.
Irie, Yuki / Matsubara, Shigeki / Kawaguchi, Nobuo / Yamaguchi, Yukiko / Inagaki, Yasuyoshi:
"Speech intention understanding based on decision tree learning",
2185-2188.
Banerjee, Satanjeev / Rudnicky, Alexander:
"Using simple speech-based features to detect the state of a meeting and the roles of the meeting participants",
2189-2192.
Yildirim, Serdar / Bulut, Murtaza / Lee, Chul Min / Kazemzadeh, Abe / Deng, Zhigang / Lee, Sungbok / Narayanan, Shrikanth / Busso, Carlos:
"An acoustic study of emotions expressed in speech",
2193-2196.
Kawahara, Tatsuya / Lane, Ian Richard / Matsui, Tomoko / Nakamura, Satoshi:
"Topic classification and verification modeling for out-of-domain utterance detection",
2197-2200.
Park, So-Young / Kwak, Yong-Jae / Lim, Joon-Ho / Rim, Hae-Chang / Kim, Soo-Hong:
"Partially lexicalized parsing model utilizing rich features",
2201-2204.
Suzuki, Yoshimi / Fukumoto, Fumiyo / Sekiguchi, Yoshihiro:
"Clustering similar nouns for selecting related news articles",
2205-2208.
Badino, Leonardo:
"Chinese text word-segmentation considering semantic links among sentences",
2209-2212.
Lee, Do-Gil / Rim, Hae-Chang:
"Syllable-based probabilistic morphological analysis model of Korean",
2213-2216.
Speaker Segmentation and Clustering
Valente, Fabio / Wellekens, Christian:
"Scoring unknown speaker clustering : VB vs. BIC",
593-596.
Jin, Qin / Schultz, Tanja:
"Speaker segmentation and clustering in meetings",
597-600.
Lamel, Lori / Gauvain, Jean-Luc / Canseco-Rodriguez, Leonardo:
"Speaker diarization from speech transcripts",
601-604.
Miro, Xavier Anguera / Pericas, Javier Hernando:
"Evolutive speaker segmentation using a repository system",
605-608.
Aronowitz, Hagai / Burshtein, David / Amir, Amihood:
"Speaker indexing in audio archives using test utterance Gaussian mixture modeling",
609-612.
Raux, Antoine:
"Automated lexical adaptation and speaker clustering based on pronunciation habits for non-native speech recognition",
613-616.
Speech Processing in a Packet Network Environment
Paliwal, Kuldip K. / So, Stephen:
"Scalable distributed speech recognition using multi-frame GMM-based block quantization",
617-620.
Srinivasamurthy, Naveen / Han, Kyu Jeong / Narayanan, Shrikanth:
"Robust speech recognition over packet networks: an overview",
621-624.
Eriksson, Thomas / Kim, Samuel / Kang, Hong-Goo / Lee, Chungyong:
"Theory for speaker recognition over IP",
625-628.
Chou, Wu / Liu, Feng:
"Voice portal services in packet network and voIP environment",
629-632.
Kabal, Peter / Elliott, Colm:
"Synchronization of speaker selection for centralized tandem free voIP conferencing",
633-636.
Kataoka, Akitoshi / Hiwasaki, Yusuke / Morinaga, Toru / Ikedo, Jotaro:
"Measuring the perceived importance of time- and frequency-divided speech blocks for transmitting over packet networks",
637-640.
Kim, Moo Young / Kleijn, W. Bastiaan:
"Comparison of transmitter - based packet-loss recovery techniques for voice transmission",
641-644.
Acoustic Modeling
Jouvet, Denis / Messina, Ronaldo:
"Context dependent "long units" for speech recognition",
645-648.
Yoshizawa, Shinichi / Shikano, Kiyohiro:
"Rapid EM training based on model-integration",
649-652.
Fohr, Dominique / Mella, Odile / Illina, Irina / Cerisara, Christophe:
"Experiments on the accuracy of phone models and liaison processing in a French broadcast news transcription system",
653-656.
Silva, Jorge / Narayanan, Shrikanth:
"A statistical discrimination measure for hidden Markov models based on divergence",
657-660.
Stadermann, Jan / Rigoll, Gerhard:
"A hybrid SVM/HMM acoustic modeling approach to automatic speech recognition",
661-664.
Knoblauch, Dirk:
"Data driven number-of-states selection in HMM topologies",
665-668.
Cho, Youngkyu / Kim, Sung-a / Yook, Dongsuk:
"Hybrid model using subspace distribution clustering hidden Markov models and semi-continuous hidden Markov models for embedded speech recognizers",
669-672.
Olsen, Peder / Visweswariah, Karthik:
"Fast clustering of Gaussians and the virtue of representing Gaussians in exponential model format",
673-676.
Livescu, Karen / Glass, James:
"Feature-based pronunciation modeling with trainable asynchrony probabilities",
677-680.
Kuo, Hong-Kwang Jeff / Gao, Yuqing:
"Maximum entropy direct model as a unified model for acoustic modeling in speech recognition",
681-684.
Zhu, Yu / Lee, Tan:
"Explicit duration modeling for Cantonese connected-digit recognition",
685-688.
Chan, Arthur / Mosur, Ravishankar / Rudnicky, Alexander / Sherwani, Jahanzeb:
"Four-layer categorization scheme of fast GMM computation techniques in large vocabulary continuous speech recognition systems",
689-692.
Park, Junho / Ko, Hanseok:
"Compact acoustic model for embedded implementation",
693-696.
Jitsuhiro, Takatoshi / Nakamura, Satoshi:
"Increasing the mixture components of non-uniform HMM structures based on a variational Bayesian approach",
697-700.
Somervuo, Panu Juhani:
"Comparison of ML, MAP, and VB based acoustic models in large vocabulary speech recognition",
701-704.
Macherey, Wolfgang / Schlüter, Ralf / Ney, Hermann:
"Discriminative training with tied covariance matrices",
705-708.
Diehl, Frank / Moreno, Asuncion:
"Acoustic phonetic modeling using local codebook features",
709-712.
Jung, Gue Jun / Kim, Su-Hyun / Oh, Yung-Hwan:
"An efficient codebook design in SDCHMM for mobile communication environments",
713-716.
Shozakai, Makoto / Nagino, Goshu:
"Analysis of speaking styles by two-dimensional visualization of aggregate of acoustic models",
717-720.
Koo, Myoung-Wan / Jeon, Ho-Hyun / Lee, Sang-Hong:
"Context dependent phoneme duration modeling with tree-based state tying",
721-724.
Bridle, John Scott:
"Towards better understanding of the model implied by the use of dynamic features in HMMs",
725-728.
Prosody Modeling and Generation
Li, Jian-Feng / Hu, Guo-Ping / Wang, Renhua:
"Chinese prosody phrase break prediction based on maximum entropy model",
729-732.
Sreenivasa Rao, Krothapalli / Yegnanarayana, Bayya:
"Intonation modeling for indian languages",
733-736.
Zheng, Yu / Lee, Gary Geunbae / Kim, Byeongchang:
"Using multiple linguistic features for Mandarin phrase break prediction in maximum-entropy classification framework",
737.
Read, Ian / Cox, Stephen:
"Using part-of-speech for predicting phrase breaks",
741-744.
Escudero-Mancebo, David / Cardenoso-Payo, Valentin:
"A proposal to quantitatively select the right intonation unit in data-driven intonation modeling",
745-748.
Ni, Jinfu / Kawai, Hisashi / Hirose, Keikichi:
"Formulating contextual tonal variations in Mandarin",
749-752.
Mouline, Salma / Boeffard, Olivier / Bagshaw, Paul:
"Automatic adaptation of the momel F0 stylisation algorithm to new corpora",
753-756.
Aguero, Pablo Daniel / Wimmer, Klaus / Bonafonte, Antonio:
"Joint extraction and prediction of fujisaki's intonation model parameters",
757-760.
Zervas, Panagiotis / Fakotakis, Nikos / Kokkinakis, George / Kouroupetroglou, George / Xydas, Gerasimos:
"Evaluation of corpus based tone prediction in mismatched environments for greek tts synthesis",
761-764.
Xiong, Ziyu / Chen, Juanwen:
"The duration of pitch transition phase and its relative factors",
765-768.
Hu, Yu / Wang, Renhua / Sun, Lu:
"Polynomial regression model for duration prediction in Mandarin",
769-772.
Tooher, Michelle / McKenna, John G.:
"Prediction of the glottal LF parameters using regression trees",
773-776.
Dellwo, Volker / Aschenberner, Bianca / Wagner, Petra / Dancovicova, Jana / Steiner, Ingmar:
"Bonntempo-corpus and bonntempo-tools: a database for the study of speech rhythm and rate",
777-780.
Gu, Wentao / Hirose, Keikichi / Fujisaki, Hiroya:
"Analysis of F0 contours of Cantonese utterances based on the command-response model",
781-784.
Dohen, Marion / Loevenbruck, Helene:
"Pre-focal rephrasing, focal enhancement and postfocal deaccentuation in French",
785-788.
Nemala, Sridhar Krishna / Talukdar, Partha Pratim / Bali, Kalika / Ramakrishnan, A. G.:
"Duration modeling for hindi text-to-speech synthesis system",
789-792.
Krishna, Nemala Sridhar / Murthy, Hema A.:
"A new prosodic phrasing model for indian language telugu",
793-796.
Jokisch, Oliver / Hofmann, Michael:
"Evolutionary optimization of an adaptive prosody model",
797-800.
Xydas, Gerasimos / Kouroupetroglou, Georgios:
"An intonation model for embedded devices based on natural F0 samples",
801-804.
Vesela, Katerina / Peterek, Nino / Hajicova, Eva:
"Prosodic characteristics of czech contrastive topic",
805-808.
Multi-Sensor ASR
Graciarena, Martin / Cesari, Federico / Franco, Horacio / Myers, Greg / Cowan, Cregg / Abrash, Victor:
"Combination of standard and throat microphones for robust speech recognition in highly noisy environments",
809-812.
Demiroglu, Cenk / David, Anderson:
"Noise robust digit recognition using a glottal radar sensor for voicing detection",
813-816.
Raub, Dominik / McDonough, John / Wöfel, Matthias:
"A cepstral domain maximum likelihod beamformer for speech recognition",
817-820.
Mochiki, Naoya / Kobayashi, Tetsunori / Sekiya, Toshiyuki / Ogawa, Tetsuji:
"Recognition of three simultaneous utterance of speech by four-line directivity microphone mounted on head of robot",
821-824.
Sagayama, Shigeki / Takashi, Okajima / Yutaka, Kamamoto / Takuya, Nishimoto:
"Complex spectrum circle centroid for microphone-array-based noisy speech recognition",
825-828.
Heck, Larry / Mao, Mark:
"Automatic speech recognition of co-channel speech: integrated speaker and speech recognition approach",
829-832.
Multi-Lingual Speech Processing
Marino, José B. / Moreno, Asuncion / Nogueiras, Albino:
"A first experience on multilingual acoustic modeling of the languages spoken in morocco",
833-836.
Caballero, Monica / Moreno, Asuncion / Nogueiras, Albino:
"Data driven multidialectal phone set for Spanish dialects",
837-840.
Oria, Daniela / Vetek, Akos:
"Multilingual e-mail text processing for speech synthesis",
841-844.
Romsdorfer, Harald / Pfister, Beat:
"Multi-context rules for phonological processing in polyglot TTS synthesis",
845-848.
Badino, Leonardo / Barolo, Claudia / Quazza, Silvia:
"A general approach to TTS reading of mixed-language texts",
849-852.
Georgiou, Panayiotis G. / Narayanan, Shrikanth S. / Mehr, Hooman Shirani:
"Context dependent statistical augmentation of persian transcripts",
853-856.
Speech Enhancement
Demiroglu, Cenk / Anderson, David V.:
"A soft decision MMSE amplitude estimator as a noise preprocessor to speech coder s using a glottal sensor",
857-860.
Hu, Rongqiang / Anderson, David V.:
"Single acoustic-channel speech enhancement based on glottal correlation using non-acoustic sensor",
861-864.
Zhang, Xianxian / Hansen, John H. L. / Arehart, Kathryn / Rossi-Katz, Jessica:
"In-vehicle based speech processing for hearing impaired subjects",
865-868.
Srinivasan, Sriram / Kleijn, W. Bastiaan:
"Speech enhancement using adaptive time-domain segmentation",
869-872.
Nakatani, Tomohiro / Kinoshita, Keisuke / Miyoshi, Masato / Zolfaghari, Parham S.:
"Harmonicity based monaural speech dereverberation with time warping and F0 adaptive window",
873-876.
Delcroix, Marc / Hikichi, Takafumi / Miyoshi, Masato:
"Dereverberation of speech signals based on linear prediction",
877-880.
Speech and Affect
Campbell, Nick:
"Perception of affect in speech - towards an automatic processing of paralinguistic information in spoken conversation",
881-884.
Chateau, Noel / Maffiolo, Valerie / Blouin, Christophe:
"Analysis of emotional speech in voice mail messages: the influence of speakers' gender",
885-888.
Lee, Chul Min / Yildirim, Serdar / Bulut, Murtaza / kazemzadeh, Abe / Busso, Carlos / Deng, Zhigang / Lee, Sungbok / Narayanan, Shrikanth:
"Emotion recognition based on phoneme classes",
889-892.
Robinson, Peter / Shikler, Tal Sobol:
"Visualizing dynamic features of expressions in speech",
893-896.
Li, Aijun / Wang, Haibo:
"Friendly speech analysis and perception in standard Chinese",
897-900.
Ní Chasaide, Ailbhe / Gobl, Christer:
"Decomposing linguistic and affective components of phonatory quality",
901-904.
Jiang, Dan-Ning / Cai, Lian-Hong:
"Classifying emotion in Chinese speech by decomposing prosodic features",
1325-1328.
Yu, Chen / Aoki, Paul / Woodruff, Allison:
"Detecting user engagement in everyday conversations",
1329-1332.
Fujisawa, Takashi / Cook, Norman D.:
"Identifying emotion in speech prosody using acoustical cues of harmony",
1333-1336.
Tao, Jianhua:
"Context based emotion detection from text input",
1337-1340.
Iwai, Atsushi / Yano, Yoshikazu / Okuma, Shigeru:
"Complex emotion recognition system for a specific user using SOM based on prosodic features",
1341-1344.
Cho, Hoon-Young / Yao, Kaisheng / Lee, Te-Won:
"Emotion verification for emotion detection and unknown emotion rejection",
1345-1348.
Hirose, Keikichi:
"Improvement in corpus-based generation of F0 contours using generation process model for emotional speech synthesis",
1349-1352.
Speech Features
Hegde, Rajesh Mahanand / Murthy, Hema A. / Gadde, Venkata Ramana Rao:
"Continuous speech recognition using joint features derived from the modified group delay function and MFCC",
905-908.
Yu, Hua:
"Phase-space representation of speech",
909-912.
Murthy, Hema A. / Hegde, Rajesh Mahanand / Gadde, Venkata Ramana Rao:
"The modified group delay feature: a new spectral representation of speech",
913-916.
Kwon, Oh-Wook / Lee, Te-Won:
"ICA-based feature extraction for phoneme recognition",
917-920.
Zhu, Qifeng / Chen, Barry / Morgan, Nelson / Stolcke, Andreas:
"On using MLP features in LVCSR",
921-924.
Chen, Barry / Zhu, Qifeng / Morgan, Nelson:
"Learning long-term temporal features in LVCSR using neural networks",
925-928.
Sreenivas, T. V. / Kiran, G. V. / Krishna, A. G.:
"Neural "spike rate spectrum" as a noise robust, speaker invariant feature for automatic speech recognition",
929-932.
Nakatoh, Yoshihisa / Nishizaki, Makoto / Yoshizawa, Shinichi / Yamada, Maki:
"An adaptive MEL-LPC analysis for speech recognition",
933-936.
Ishizuka, Kentaro / Miyazaki, Noboru / Nakatani, Tomohiro / Minami, Yasuhiro:
"Improvement in robustness of speech feature extraction method using sub-band based periodicity and aperiodicity decomposition",
937-940.
Ishi, Carlos Toshinori:
"A new acoustic measure for aspiration noise detection",
941-944.
Demuynck, Kris / Garcia, Oscar / Compernolle, Dirk Van:
"Synthesizing speech from speech recognition parameters",
945-948.
Athineos, Marios / Hermansky, Hynek / Ellis, Daniel P.W.:
"LP-TRAP: linear predictive temporal patterns",
949-952.
Li, Xiang / Stern, Richard:
"Parallel feature generation based on maximizing normalized acoustic likelihood",
953-956.
Wang, Kun-Ching:
"An adaptive band-partitioning spectral entropy based speech detection in realistic noisy environments",
957-960.
Ramirez, Javier / Segura, José Carlos / Benitez, Carmen / Torre, Angel de la / Rubio, Antonio:
"Improved voice activity detection combining noise reduction and subband divergence measures",
961-964.
Park, Kiyoung / Choi, Changkyu / Kim, Jeongsu:
"Voice activity detection using global soft decision with mixture of Gaussian model",
965-968.
Kemp, Thomas / Nadeu, Climent / Lam, Yin Hay / Caros, Josep Maria Sola i:
"Environmental robust features for speech detection",
969-972.
Laskowski, Kornel / Jin, Qin / Schultz, Tanja:
"Crosscorrelation-based multispeaker speech activity detection",
973-976.
Tsai, Shang-nien:
"Improved robustness of time-frequency principal components (TFPC) by synergy of methods in different domains",
977-980.
Deng, Li / Dong, Yu / Acero, Alex:
"A quantitative model for formant dynamics and contextually assimilated reduction in fluent speech",
981-984.
Kubin, Gernot / Pham, Van Tuan:
"DWT-based classification of acoustic-phonetic classes and phonetic units",
985-988.
Cho, Yong-Choon / Choi, Seungjin:
"Learning nonnegative features of spectro-temporal sounds for classification",
989-992.
Language Modeling, Multimodal & Multilingual Speech Processing
Chung, Sungyup / Hirose, Keikichi / Minematsu, Nobuaki:
"N-gram language modeling of Japanese using bunsetsu boundaries",
993-996.
Chen, Langzhou / Lamel, Lori / Gauvain, Jean-Luc / Adda, Gilles:
"Dynamic language modeling for broadcast news",
997-1000.
Lyu, Ren-Yuan / Lyu, Dau-Cheng / Liang, Min-Siong / Wang, Min-Hong / Chiang, Yuang-Chin / Hsu, Chun-Nan:
"A unified framework for large vocabulary speech recognition of mutually unintelligible Chinese "regionalects"",
1001-1004.
Sluis, Ielka van der / Krahmer, Emiel:
"The influence of target size and distance on the production of speech and gesture in multimodal referring expressions",
1005-1008.
Gupta, Anurag Kumar / Anastasakos, Tasos:
"Dynamic time windows for multimodal input fusion",
1009-1012.
Lee, Raymond H. / Gupta, Anurag Kumar:
"MICot : a tool for multimodal input data collection",
1013-1016.
Tadj, Chakib / Djenidi, Hicham / Haouani, Madjid / Ramdane-Cherif, Amar / Levy, Nicole:
"Simulating multimodal applications",
1017-1020.
Pedersen, Jakob Schou / Dalsgaard, Paul / Lindberg, Borge:
"A multimodal communication aid for global aphasia patients",
1021-1024.
Yamamoto, Hirofumi / Kikui, Genichiro / Sagisaka, Yoshinori:
"Mis-recognized utterance detection using hierarchical language model",
1025-1028.
Moberg, Marko / Parssinen, Kimmo / Iso-Sipila, Juha:
"Cross-lingual phoneme mapping for multilingual synthesis systems",
1029-1032.
Komatani, Kazunori / Ogata, Tetsuya / Okuno, Hiroshi G. / Tasaki, Tsuyoshi / Yamaguchi, Takeshi:
"Robot motion control using listener's back-channels and head gesture information",
1033-1036.
Sakti, Sakriani / Arman, Arry Akhmad / Nakamura, Satoshi / Hutagaol, Paulus:
"Indonesian speech recognition for hearing and speaking impaired people",
1037-1040.
Rashwan, Mohsen:
"A two phase arabic language model for speech recognition and other language applications",
1041-1044.
Akita, Yuya / Kawahara, Tatsuya:
"Language model adaptation based on PLSA of topics and speakers",
1045-1048.
Dolfing, Hans J. G. A. / Buckley, Pierce Gerard / Horowitz, David:
"Unified language modeling using finite-state transducers with first applications",
1049-1052.
Itou, Katsunobu / Fujii, Atsushi / Akiba, Tomoyosi:
"Effects of language modeling on speech-driven question answering",
1053-1056.
Sethy, Abhinav / Narayanan, Shrikanth / Ramabhadran, Bhuvana:
"Measuring convergence in language model estimation using relative entropy",
1057-1060.
Detection and Classification in ASR
Huang, Rongqing / Hansen, John H. L.:
"High-level feature weighted GMM network for audio stream classification",
1061-1064.
Zdansky, Jindrich / David, Petr / Nouza, Jan:
"An improved preprocessor for the automatic transcription of broadcast news audio stream",
1065-1068.
Wang, Yih-Ru / Huang, Chi-Han:
"Speaker-and-environment change detection in broadcast news using the common component GMM-based divergence measure",
1069-1072.
Lahti, Tommi:
"Beginning of utterance detection algorithm for low complexity ASR engines",
1073-1076.
Sukittanon, Somsak / Surendran, Arun C. / Platt, John C. / Burges, Chris J.C.:
"Convolutional networks for speech detection",
1077-1080.
Gangashetty, Suryakanth V. / Chandra Sekhar, Chellu / Yegnanarayana, Bayya:
"Detection of vowel on set points in continuous speech using autoassociative neural network models",
1081-1084.
Speech Analysis
Tamiya, Toshiki / Shimamura, Tetsuya:
"Reconstruction filter design for bone-conducted speech",
1085-1088.
Quintana-Morales, Pedro J. / Navarro-Mesa, Juan L.:
"Frequency warped ARMA analysis of the closed and the open phase of voiced speech",
1089-1192.
Doval, Boris / Bozkurt, Baris / D'Alessandro, Christophe / Dutoit, Thierry:
"Zeros of z-transform (ZZT) decomposition of speech for source-tract separation",
1093-1096.
Deng, Li / Togneri, Roberto:
"Use of neural network mapping and extended kalman filter to recover vocal tract resonances from the MFCC parameters of speech",
1097-1100.
Li, Xiao / Malkin, Jonathan / Bilmes, Jeff:
"Graphical model approach to pitch tracking",
1101-1104.
Xu, Bo / Tao, Jianhua / Kang, Yongguo:
"A new multicomponent AM-FM demodulation with predicting frequency boundaries and its application to formant estimation",
1105-1108.
Laprie, Yves:
"A concurrent curve strategy for formant tracking",
2405-2408.
Yan, Qin / Zavarehei, Esfandiar / Vaseghi, Saeed / Rentzos, Dimitrios:
"A formant tracking LP model for speech processing",
2409-2412.
You, Hong:
"Application of long-term filtering to formant estimation",
2413-2416.
Bozkurt, Baris / Dutoit, Thierry / Doval, Boris / D'Alessandro, Christophe:
"A method for glottal formant frequency estimation",
2417-2420.
Bozkurt, Baris / Dutoit, Thierry / Doval, Boris / D'Alessandro, Christophe:
"Improved differential phase spectrum processing for formant tracking",
2421-2424.
Shao, Xu / Milner, Ben P.:
"MAP prediction of pitch from MFCC vectors for speech reconstruction",
2425-2428.
Yu, An-Tze / Wang, Hsiao-Chuan:
"New harmonicity measures for pitch estimation and voice activity detection",
2429-2432.
Nishimoto, Takuya / Sagayama, Shigeki / Kameoka, Hirokazu:
"Multi-pitch trajectory estimation of concurrent speech based on harmonic GMM and nonlinear kalman filtering",
2433-2436.
Ferencz, Attila / Kim, Jeongsu / Lee, Yong-Beom / Lee, Jae-Won:
"Automatic pitch marking and reconstruction of glottal closure instants from noisy and deformed electro-glotto-graph signals",
2437-2440.
Flego, Federico / Armani, Luca / Omologo, Maurizio:
"On the use of a weighted autocorrelation based fundamental frequency estimation for a multidimensional speech input",
2441-2444.
Reddy, Aarthi M. / Raj, Bhiksha:
"A minimum mean squared error estimator for single channel speaker separation",
2445-2448.
Molla, Md. Khademul Islam / Hirose, Keikichi / Minematsu, Nobuaki:
"Audio source separation from the mixture using empirical mode decomposition with independent subspace analysis",
2449-2452.
Oh, In-Jung / Chung, Hyun-Yeol / Cho, Jae-Won / Jung, Ho-Youl / Prost, R.:
"Audio watermarking in sub-band signals using multiple echo kernels",
2453-2456.
Zhang, Jie / Wu, Zhenyang:
"A piecewise interpolation method based on log-least square error criterion for HRTF",
2457-2460.
Navarro-Mesa, Juan L. / Quintana-Morales, Pedro J.:
"Modified realizable frequency warped ARMA modeling and its application in synthesis structures for voiced speech",
2461-2464.
Muralishankar, R. / Ramakrishnan, A. G. / Kaushik, Lakshmish N.:
"Time-scaling of speech using independent subspace analysis",
2465-2468.
Girin, Laurent / Firouzmand, Mohammad / Marchand, Sylvain:
"Long term modeling of phase trajectories within the speech sinusoidal model framework",
2469-2472.
Soltani, Tina / Hermann, Dave / Cornu, Etienne / Sheikhzadeh, Hamid / Brennan, Rob:
"An acoustic shock limiting algorithm using time and frequency domain speech features",
2473-2476.
Shin, Jong Won / Chang, Joon-Hyuk / Kim, Nam Soo:
"Speech probability distribution based on generalized gama distribution",
2477-2480.
Zheng, Yanli / Hasegawa-Johnson, Mark / Borys, Sarah:
"Stop consonant classification by dynamic formant trajectory",
2481-2484.
Shiga, Yoshinori / King, Simon:
"Estimating detailed spectral envelopes using articulatory clustering",
2485-2488.
Speech Production
Engwall, Olov:
"From real-time MRI to 3d tongue movements",
1109-1112.
Nakamura, Mitsuhiro:
"Coarticulatory variability and directionality in [s,..]: an EPG study",
1113-1116.
Tanabe, Yosuke / Kaburagi, Tokihiko:
"Flow representation through the glottis having a polygonal boundary shape",
1117-1120.
Pulakka, Hannu / Alku, Paavo / Granqvist, Svante / Hertegard, Stellan / Larsson, Hans / Laukkanen, Anne-Maria / Lindestad, Per-Ake / Vilkman, Erkki:
"Analysis of the voice source in different phonation types: simultaneous high-sped imaging of the vocal fold vibration and glottal inverse filtering",
1121-1124.
Birkholz, Peter / Jackel, Dietmar:
"Influence of temporal discretization schemes on formant frequencies and bandwidths in time domain simulations of the vocal tract system",
1125-1128.
Toda, Tomoki / Black, Alan / Tokuda, Keiichi:
"Acoustic-to-articulatory inversion mapping with Gaussian mixture model",
1129-1132.
Audio-Visual Speech Processing
Kim, Jinyoung / Kim, Jeesun / Davis, Chris:
"Audio-visual spoken language processing",
1133-1136.
Sekiyama, Kaoru / Burnham, Denis:
"Issues in the development of auditory-visual speech perception: adults, infants, and children",
1137-1140.
Krahmer, Emiel / Swerts, Marc:
"Signaling and detecting uncertainty in audiovisual speech by children and adults",
1141-1144.
Hazan, Valerie / Sennema, Anke / Faulkner, Andrew:
"Effect of intensive audiovisual perceptual training on the perception and production of the /l/-/r/ contrast for Japanese learners of English",
1145-1148.
Vroomen, Jean / Linden, Sabine van / Gelder, Beatrice de / Bertelson, Paul:
"Visual recalibration of auditory speech versus selective speech adaptation: different build-up courses",
1149-1152.
Davis, Chris / Kim, Jeesun:
"Of the top of the head: audio-visual speech perception from the nose up",
1153-1156.
Millar, J. Bruce / Wagner, Michael / Goecke, Roland:
"Aspects of speaking-face data corpus design methodology",
1157-1160.
Schwartz, Jean-Luc / Cathiard, Marie:
"Modeling audio-visual speech perception: back on fusion architectures and fusion control",
2017-2020.
Sams, Mikko / Ojanen, Ville / Tuomainen, Jyrki / Klucharev, Vasily:
"Neurocognition of speech-specific audiovisual perception",
2021-2024.
Barbosa, Adriano Vilela / Vatikiotis-Bateson, Eric / Daffertshofer, Andreas:
"Target practice on talking faces",
2025-2028.
Odisio, Matthias / Bailly, Gérard:
"Audiovisual perceptual evaluation of resynthesised speech movements",
2029-2032.
Fagel, Sascha:
"Video-realistic synthetic speech with a parametric visual speech synthesizer",
2033-2036.
Scanlon, Patricia / Potamianos, Gerasimos / Libal, Vit / Chu, Stephen M.:
"Mutual information based visual feature selection for lipreading",
2037-2040.
Lee, Bowon / Hasegawa-Johnson, Mark / Goudeseune, Camille / Kamdar, Suketu / Borys, Sarah / Liu, Ming / Huang, Thomas:
"AVICAR: audio-visual speech corpus in a car environment",
2489-2492.
Erzin, Engin / Yemez, Yucel / Tekalp, A. Murat:
"Adaptive classifier cascade for multimodal speaker identification",
2493-2496.
Iba, Midori / Sennema, Anke / Hazan, Valerie / Faulkner, Andrew:
"Use of visual cues in the perception of a labial/labiodental contrast by Spanish-L1 and Japanese-L1 learners of English",
2497-2500.
Zhang, Xianxian / Takeda, Kazuya / Hansen, John H. L. / Maeno, Toshiki:
"Audio-visual SPeaker localization for car navigation systems",
2501-2504.
Chaloupka, Josef:
"Automatic lips reading for audio-visual speech processing and recognition",
2505-2508.
Wagner, Michael / Chetty, Girija:
""liveness" verification in audio-video authentication",
2509-2512.
Martinez, Maria José Sanchez / Gutierrez, Juan Pablo de la Cruz:
"Speech recognition using motion based lipreading",
2513-2516.
Berthommier, Frédéric:
"Comparative study of linear and non-linear models for viseme in version: modeling of a cortical associative function",
2517-2520.
Cisar, Petr / Krnoul, Zdenek / Zelezny, Milos:
"3d lip-tracking for audio-visual speech recognition in real applications",
2521-2524.
Millar, J. Bruce / Goecke, Roland:
"The audio-video australian English speech data corpus AVOZES",
2525-2528.
Hong, Ki-Hyung / Lee, Yong-Ju / Suh, Jae-Young / Lee, Kyong-Nim:
"Correcting Korean vowel speech recognition errors with limited lip features",
2529-2532.
Nielsen, Kuniko:
"Segmental differences in the visual contribution to speech inteligibility",
2533-2536.
Spoken Language Generation and Synthesis III
Ye, Hui / Young, Steve:
"Voice conversion for unknown speakers",
1161-1164.
Fischer, Volker / Ordinas, Jaime Botella / Kunzmann, Siegfried:
"Domain adaptation methods in the IBM trainable text-to-speech system",
1165-1168.
Zhou, Yi / Zu, Yiqing / Yu, Zhenli / Yue, Dongjian / Chen, Guilin:
"Applying pitch connection control in Mandarin speech synthesis",
1169-1172.
Ney, Hermann / Suendermann, David / Bonafonte, Antonio / Hoege, Harald:
"A first step towards text-independent voice conversion",
1173-1176.
Yu, Zhenli / Wang, Kaizhi / Zu, Yiqing / Yue, Dongjian / Chen, Guilin:
"Data pruning approach to unit selection for inventory generation of concatenative embeddable Chinese TTS systems",
1177-1180.
Vepa, Jithendra / King, Simon:
"Subjective evaluation of join cost functions used in unit selection speech synthesis",
1181-1184.
Zen, Heiga / Kitamura, Tadashi / Bulut, Murtaza / Narayanan, Shrikanth / Tsuzuki, Ryosuke / Tokuda, Keiichi:
"Constructing emotional speech synthesizers with limited speech database",
1185-1188.
Lin, Cheng-Yuan / Jang, Jyh-Shing Roger:
"A two-phase pitch marking method for TD-PSOLA synthesis",
1189-1192.
Bonafonte, Antonio / Kain, Alexander / Santen, Jan van / Duxans, Helenca:
"Including dynamic and phonetic information in voice conversion systems",
1193-1196.
Wang, Zixiang / Wang, Renhua / Shuang, Zhiwei / Ling, Zhenhua:
"A novel voice conversion system based on codebook mapping with phoneme-tied weighting",
1197-1200.
Ling, Zhenhua / Hu, Yu / Shuang, Zhiwei / Wang, Renhua:
"Compression of speech database by feature separation and pattern clustering using STRAIGHT",
1201-1204.
Kataoka, Shunsuke / Mizutani, Nobuaki / Tokuda, Keiichi / Kitamura, Tadashi:
"Decision-tree backing-off in HMM-based speech synthesis",
1205-1208.
Nishizawa, Nobuyuki / Kawai, Hisashi:
"Using a depth-restricted search to reduce delays in unit selection",
1209-1212.
Yamagishi, Junichi / Masuko, Takashi / Kobayashi, Takao:
"MLLR adaptation for hidden semi-Markov model based speech synthesis",
1213-1216.
Breuer, Stefan / Abresch, Julia:
"Phoxsy: multi-phone segments for unit selection speech synthesis",
1217-1220.
Alias, Francesc / Llora, Xavier / Iriondo, Ignasi / Socoro, Joan Claudi / Sevillano, Xavier / Formiga, Lluis:
"Perception-guided and phonetic clustering weight tuning based on diphone pairs for unit selection TTS",
1221-1224.
En-Najjary, Taoufik / Rosec, Olivier / Chonavel, Thierry:
"A voice conversion method based on joint pitch and spectral envelope transformation",
1225-1228.
En-Najjary, Taoufik / Rosec, Olivier / Chonavel, Thierry:
"Fast GMM-based voice conversion for text-to-speech synthesis systems",
1229-1232.
Kumar, Rohit:
"A genetic algorithm for unit selection based speech synthesis",
1233-1236.
Huang, Jun / Olorenshaw, Lex / Hernandez-Abrego, Gustavo / Duan, Lei:
"A memory efficient grapheme-to-phoneme conversion system for speech processing",
1237-1240.
Kumar, Rohit / Kishore, S. Prahallad:
"Automatic pruning of unit selection speech databases for synthesis without loss of naturalness",
1377-1380.
Lambert, Tanya / Breen, Andrew:
"A database design for a TTS synthesis system using lexical diphones",
1381-1384.
Kominek, John / Black, Alan W:
"A family-of-models approach to HMM-based segmentation for unit selection speech synthesis",
1385-1388.
Zhang, Wei / Jin, Ling / Ma, Xijun:
"Mutual-information based segment pre-selection in concatenative text-to-speech",
1389-1392.
Zen, Heiga / Tokuda, Keiichi / Masuko, Takashi / Kobayashi, Takao / Kitamura, Tadashi:
"Hidden semi-Markov model based speech synthesis",
1393-1396.
Pfitzinger, Hartmut R.:
"DFW-based spectral smoothing for concatenative speech synthesis",
1397-1400.
Min, Kyung-Joong / Lim, Un-Cheon:
"Korean prosody generation and artificial neural networks",
1869-1872.
Yoon, Kyuchul:
"A prosodic phrasing model for a Korean text-to-speech synthesis system",
1873-1876.
Shi, Qin / Fischer, Volker:
"A comparison of statistical methods and features for the prediction of prosodic structures",
1877-1880.
Chen, Guilin / Han, Ke-Song:
"Letter-to-sound for small-footprint multilingual TTS engine",
1881-1884.
Xu, Jun / Fu, Guohong / Li, Haizhou:
"Grapheme-to-phoneme conversion for Chinese text-to-speech",
1885-1888.
Schröder, Marc / Breuer, Stefan:
"XML representation languages as a way of interconnecting TTS modules",
1889-1892.
Cao, Wenjie / Zong, Chengqing / Xu, Bo:
"Approach to interchange-format based Chinese generation",
1893-1896.
Zovato, Enrico / Sandri, Stefano / Quazza, Silvia / Badino, Leonardo:
"Prosodic analysis of a multi-style corpus in the perspective of emotional speech synthesis",
1897-1900.
Min, Kyung-Joong / Kang, Chan-Goo / Lim, Un-Cheon:
"Number of output nodes of artificial neural networks for Korean prosody generation",
1901-1904.
Kim, Sunhee / Ahn, Ju-Eun / Kim, Soon-Hyob / Lee, Yang-Hee:
"A Korean grapheme-to-phoneme conversion system using selection procedure for exceptions",
1905-1908.
Khaorapapong, Thanate / Karnjanadecha, Montri / Inthavisas, Keerati:
"Synthesis of vowels and tones in Thai language by articulatory modeling",
1909-1912.
Shiga, Yoshinori / King, Simon:
"Source-filter separation for articulation-to-speech synthesis",
1913-1916.
Hisako, Asano / Hideharu, Nakajima / Hideyuki, Mizuno / Masahiro, Oku:
"Long vowel detection for letter-to-sound conversion for Japanese sourced words transliterated into the alphabet",
1917-1920.
Clermont, Frantz / Millhouse, Thomas John:
"Inexactness and robustness in cepstral-to-formant transformation of spoken and sung vowels",
1921-1924.
Saitou, Takeshi / Tsuji, Naoya / Unoki, Masashi / Akagi, Masato:
"Analysis of acoustic features affecting "singing-ness" and its application to singing-voice synthesis from speaking-voice",
1925-1928.
Pollet, Vincent / Coorman, Geert:
"Statistical corpus-based speech segmentation",
1929-1932.
Matousek, Jindrich / Romportl, Jan / Tihelka, Daniel / Tychtl, Zbynek:
"Recent improvements on ARTIC: czech text-to-speech system",
1933-1936.
Nam, HyeonSook / Jung, Youngim / Lee, Donghun / Kwon, Hyuk-chul / Yoon, Aesun:
"Learning for transliteration of arabic-numeral expressions using decision tree for Korean TTS",
1937-1940.
Beringer, Nicole:
"How to integrate phonetic and linguistic knowledge in a text-to-phoneme conversion task: a syllabic TPC tool for French",
1941-1944.
Hamza, Wael / Eide, Ellen / Bakis, Raimo:
"Reconciling pronunciation differences between the front-end and the back-end in the IBM speech synthesis system",
2561-2564.
Ha, Juhong / Zheng, Yu / Lee, Gary Geunbae / Seong, Yoon-Suk / Kim, Byeongchang:
"High quality text-to-pinyin conversion using two-phase unknown word prediction",
2565-2568.
Kim, Yeon-Jun / Syrdal, Ann / Conkie, Alistair:
"Pronunciation lexicon adaptation for TTS voice building",
2569-2572.
Webster, Gabriel:
"Improving letter-to-pronunciation accuracy with automatic morphologically-based stress prediction",
2573-2576.
Hamza, Wael / Eide, Ellen / Bakis, Raimo / Picheny, Michael / Pitrelli, John:
"The IBM expressive speech synthesis system",
2577-2580.
Schnell, Markus / Hoffmann, Rüdiger:
"What concept-to-speech can gain for prosody",
2581-2584.
Speech Recognition - Language Model
Kawahara, Tatsuya / Uchimoto, Kiyotaka / Isahara, Hitoshi / Shitaoka, Kazuya:
"Dependency structure analysis and sentence boundary detection in spontaneous Japanese",
1353-1356.
Jamoussi, Salma / Langlois, David / Haton, Jean-Paul / Smaili, Kamel:
"Statistical feature language model",
1357-1360.
Bigi, Brigitte / Huang, Yan / Mori, Renato De:
"Vocabulary and language model adaptation using information retrieval",
1361-1364.
Mori, Shinsuke / Takuma, Daisuke:
"Word n-gram probability estimation from a Japanese raw corpus",
1365-1368.
Chien, Jen-Tzung / Chen, Hung-Ying:
"Mining of association patterns for language modeling",
1369-1372.
Chien, Jen-Tzung / Wu, Meng-Sung / Peng, Hua-Jui:
"On latent semantic language modeling and smoothing",
1373-1376.
Goel, Vaibhava:
"Conditional maximum likelihood estimation for improving annotation performance of n-gram models incorporating stochastic finite state grammars",
2237-2241.
Schofield, Edward James:
"Fast parameter estimation for joint maximum entropy language models",
2241-2244.
Vergyri, Dimitra / Kirchhoff, Katrin / Duh, Kevin / Stolcke, Andreas:
"Morphology-based language modeling for arabic speech recognition",
2245-2248.
Khan, A. Nayeemulla / Yegnanarayana, Bayya:
"Speech enhanced multi-Span language model",
2249-2252.
Schwenk, Holger / Gauvain, Jean-Luc:
"Neural network language models for conversational speech recognition",
2253-2256.
Mrva, David / Woodland, Philip C.:
"A PLSA-based language model for conversational telephone speech",
2257-2260.
Speaker Recognition
Louradour, Jerome / André-Obrecht, Regine / Daoudi, Khalid:
"Segmentation and relevance measure for speaker verification",
1401-1404.
Chetouani, Mohamed / Gas, Bruno / Zarader, Jean-Luc / Faundez-Zanuy, Marcos:
"A new nonlinear feature extraction algorithm for speaker verification",
1405-1408.
Shriberg, Elizabeth / Ferrer, Luciana / Venkataraman, Anand / Kajarekar, Sachin:
"SVM modeling of "SNERF-grams" for speaker recognition",
1409-1412.
Ho, Purdy / Moreno, Pedro:
"SVM kernel adaptation in speaker classification and verification",
1413-1416.
Iwano, Koji / Asami, Taichi / Furui, Sadaoki:
"Noise-robust speaker verification using F0 features",
1417-1420.
Chen, Zi-He / Liao, Yuan-Fu / Juang, Yau-Tarng:
"Eigen-prosody analysis for robust speaker recognition under mismatch handset environment",
1421.
Lawson, Aaron / Huggins, Mark:
"Triphone-based confidence system for speaker identification",
1745-1748.
Yoshida, Kenichi / Takagi, Kazuyuki / Ozeki, Kazuhiko:
"Improved model training and automatic weight adjustment for multi-SNR multi-band speaker identification system",
1749-1752.
Mak, Man-Wai / Yiu, Kwok-kwong / Cheun, Ming-Cheung / Kung, Sun-Yuan:
"A new approach to channel robust speaker verification via constrained stochastic feature transformation",
1753-1756.
Tadj, Chakib / Gargour, Christian / Badri, Nabil:
"Best speaker-based structure tree for speaker verification",
1757-1760.
Chow, David / Abdulla, Waleed:
"Robust speaker identification based on perceptual log area ratio and Gaussian mixture models",
1761-1764.
Wenndt, Stanley / Floyd, Richard:
"Channel frequency response correction for speaker recognition",
1765-1768.
Yang, Jyh-Her / Liao, Yuan-Fu:
"Unseen handset mismatch compensation based on a priori knowledge interpolation for robust speaker recognition",
1769-1772.
Padilla, Michael / Quatieri, Thomas:
"A comparison of soft and hard spectral subtraction for speaker verification",
1773-1776.
Radova, Vlasta / Padrta, Ales:
"Comparison of several speaker verification procedures based on GMM",
1777-1780.
Guan, Yong / Liu, Wenju / Qi, Hongwei / Wang, Jue:
"Improving performance of text-independent speaker identification by utilizing contextual principal curves filtering",
1781-1784.
Chien, Jen-Tzung / Ting, Chuan-Wei:
"Speaker identification using probabilistic PCA model selection",
1785-1788.
Aronowitz, Hagai / Burshtein, David / Amir, Amihood:
"Text independent speaker recognition using speaker dependent word spotting",
1789-1792.
Wang, Hsiao-Chuan / Cheng, Jyh-Min:
"A study on model-based equal error rate estimation for automatic speaker verification",
1793-1796.
Matsui, Tomoko / Tanabe, Kunio:
"Probabilistic speaker identification with dual penalized logistic regression machine",
1797-1800.
Saeta, Javier R. / Hernando, Javier:
"Model quality evaluation during enrolment for speaker verification",
1801-1804.
Frati, Pasi / Karpov, Evgeny / Kinnunen, Tomi:
"Real-time speaker identification",
1805-1808.
El-Yazeed, Mohammed Abu / Kader, Nemat Abdel / El-Henawy, Mohammed:
"Multi-codebook vector quantization algorithm for speaker identification",
1809-1812.
Cheung, Ming-Cheung / Yiu, Kwok-Kwong / Mak, Man-Wai / Kung, Sun-Yuan:
"Multi-sample fusion with constrained feature transformation for robust speaker verification",
1813-1816.
Betser, Michael / Bimbot, Frédéric / Ben, Mathieu / Gravier, Guillaume:
"Speaker diarization using bottom-up clustering based on a parameter-derived distance between adapted GMMs",
2329-2332.
Zheng, Nengheng / Ching, P. C. / Lee, Tan:
"Time -frequency analysis of vocal source signal for speaker recognition",
2333-2336.
Gangadharaiah, Rashmi / Narayanaswamy, Balakrishnan / Balakrishnan, Narayanaswamy:
"A novel method for two-speaker segmentation",
2337-2340.
Yegnanarayana, Bayya / Shahina, A. / Kesheorey, M. R.:
"Throat microphone signal for speaker recognition",
2341-2344.
Ben Zeghiba, Mohamed Faouzi / Bourlard, Hervé:
"Posteriori probabilities and likelihoods combination for speech and speaker recognition",
2345-2348.
Mihoubi, Mohamed / O'Shaughnessy, Douglas / Dumouchel, Pierre:
"The use of typical sequences for robust speaker identification",
2349-2352.
Kim, KyungHwa:
"A forensic phonetic investigation into the duration and speech rate",
2353-2356.
Sreenivas, T. V. / , Sameer Badaskar / Badaskar, Sameer:
"Mixture Gaussian model training against impostor model parameters: an application to speaker identification",
2357-2360.
Anguita, Jan / Hernando, Javier / Abad, Alberto:
"Jacobian adaptation with improved noise reference for speaker verification",
2361-2364.
Siafarikas, Mihalis / Ganchev, Todor / Fakotakis, Nikos:
"Objective wavelet packet features for speaker verification",
2365-2368.
Chaudhari, Upendra V. / Ramaswamy, Ganesh N.:
"Policy analysis framework for conversational biometrics",
2369-2372.
Choi, Woo-Yong / Kim, Jung Gon / Kim, Hyung Soon / Pan, Sung Bum:
"A new score normalization method for speaker verification with virtual impostor model",
2373-2376.
Kim, Samuel / Eriksson, Thomas / Kang, Hong-Goo:
"On the time variability of vocal tract for speaker recognition",
2377-2380.
Desai, Veena / Murthy, Hema A.:
"Distributed speaker recognition",
2381-2384.
Angkititrakul, Pongtep / Baghaii, Sepideh / Hansen, John H. L.:
"Cluster-dependent modeling and confidence measure processing for in-set/out-of-set speaker identification",
2385-2388.
Umeda, Yoshiyuki / Kuroiwa, Shingo / Tsuge, Satoru / Ren, Fuji:
"Distributed speaker recognition using earth mover's distance",
2389-2392.
Barlow, Michael / Khodai-Joopari, Mehrdad / Clermont, Frantz:
"A forensically-motivated tool for selecting cepstrally-consistent steady-states from non-contemporaneous vowel utterances",
2393-2396.
Alexander, Anil / Drygajlo, Andrzej:
"Scoring and direct methods for the interpretation of evidence in forensic speaker recognition",
2397-2400.
Kinnunen, Tomi / Karpov, Evgeny / Franti, Pasi:
"Efficient online cohort selection method for speaker verification",
2401-2404.
Navratil, Jiri / Ramaswamy, Ganesh N. / Zilca, Ran D.:
"Statistical model migration in speaker recognition",
2585-2588.
Khan, A. Nayeemulla / Yegnanarayana, Bayya:
"Latent semantic analysis for speaker recognition",
2589-2592.
Shao, Yang / Wang, DeLiang:
"Model-based sequential organization for cochannel speaker identification",
2593-2596.
Leung, Ka-Yee / Mak, Man-Wai / Kung, Sun-Yuan:
"Articulatory feature-based conditional pronunciation modeling for speaker verification",
2597-2600.
Park, Alex / Hazen, Timothy J.:
"A comparison of normalization and training approaches for ASR-dependent speaker identification",
2601-2604.
Tran, Dat:
"New background modeling for speaker verification",
2605-2608.
Processing of Prosody by Humans and Machines
Bailly, Gérard / Holm, Bleicke / Auberge, Veronique:
"A trainable prosodic model: learning the contours implementing communicative functions within a superpositional model of intonation",
1425-1428.
Nguyen, Dung Tien / Luong, Mai Chi / Vu, Bang Kim / Mixdorff, Hansjoerg / Ngo, Huy Hoang:
"Fujisaki model based F0 contours in vietnamese TTS",
1429-1432.
Ashimura, Kazuyuki / Kashioka, Hideki / Campbell, Nick:
"Estimating speaking rate in spontaneous speech from z-scores of pattern durations",
1433-1436.
Masuko, Takashi / Kobayashi, Takao / Miyanaga, Keisuke:
"A style control technique for HMM-based speech synthesis",
1437-1440.
Hasegawa-Johnson, Mark / Levinson, Stephen / Zhang, Tong:
"Children's emotion recognition in an intelligent tutoring scenario",
1441-1444.
Hirose, Keikichi / Minematsu, Nobuaki:
"Use of prosodic features for speech recognition",
1445-1448.
Contemporary Issues in ASR
Peters, Jochen / Drexel, Christina:
"Transformation-based error correction for speech-to-text systems",
1449-1452.
Gutkin, Alexander / King, Simon:
"Phone classification in pseudo-euclidean vector spaces",
1453-1456.
Chung, Grace / Wang, Chao / Seneff, Stephanie / Filisko, Ed / Tang, Min:
"Combining linguistic knowledge and acoustic information in automatic pronunciation lexicon generation",
1457-1460.
Chen, Ken / Hasegawa-Johnson, Mark:
"Modeling pronunciation variation using artificial neural networks for English spontaneous speech",
1461-1464.
Aalburg, Stefanie / Hoege, Harald:
"Foreign-accented speaker-independent speech recognition",
1465-1468.
Heracleous, Panikos / Nakajima, Yoshitaka / Lee, Akinobu / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Non-audible murmur (NAM) speech recognition using a stethoscopic NAM microphone",
1469-1472.
Russell, Martin / D'Arcy, Shona / Wong, Lit Ping:
"Recognition of read and spontaneous children's speech using two new corpora",
1473-1476.
Frankel, Joe / Wester, Mirjam / King, Simon:
"Articulatory feature recognition using dynamic Bayesian networks",
1477-1480.
Bouwman, Gies / Cranen, Bert / Boves, Lou:
"Predicting word correct rate from acoustic and linguistic confusability",
1481-1484.
Ishihara, Kazushi / Hattori, Yuya / Nakatani, Tomohiro / Komatani, Kazunori / Ogata, Tetsuya / Okuno, Hiroshi G.:
"Disambiguation in determining phonemes of sound-imitation words for environmental sound recognition",
1485-1488.
Anguita, Jan / Peillon, Stephane / Hernando, Javier / Bramoulle, Alexandre:
"Word confusability prediction in automatic speech recognition",
1489-1492.
Jou, Szu-Chen / Schultz, Tanja / Waibel, Alex:
"Adaptation for soft whisper recognition using a throat microphone",
1493-1496.
Gruhn, Rainer / Markov, Konstantin / Nakamura, Satoshi:
"A statistical lexicon for non-native speech recognition",
1497-1500.
Doss, Mathew Magimai / Ikbal, Shajith / Stephenson, Todd / Bourlard, Hervé:
"Modeling auxiliary features in tandem systems",
1501-1504.
Bosch, Louis ten / Boves, Lou:
"Survey of spontaneous speech phenomena in a multimodal dialogue system and some implications for ASR",
1505-1508.
Cincarek, Tobias / Gruhn, Rainer / Nakamura, Satoshi:
"Speech recognition for multiple non-native accent groups with speaker-group-dependent acoustic models",
1509-1512.
Stouten, Frederik / Martens, Jean-Pierre:
"Coping with disfluencies in spontaneous speech recognition",
1513-1516.
Kwon, Soonil / Narayanan, Shrikanth:
"Speaker model quantization for unsupervised speaker indexing",
1517-1520.
Gerosa, Matteo / Giuliani, Diego:
"Investigating automatic recognition of non-native children's speech",
1521-1524.
Liu, Yang / Shriberg, Elizabeth / Stolcke, Andreas / Harper, Mary:
"Using machine learning to cope with imbalanced classes in natural speech: evidence from sentence boundary and disfluency detection",
1525-1528.
Jin, Minho / Jang, Gyucheol / Yun, Sungrack / Yoo, Chang D.:
"Hybrid utterance verification based on n-best models and model derived from kulback-leibler divergence",
1529-1532.
Goto, Masataka / Kitayama, Koji / Itou, Katsunobu / Kobayashi, Tetsunori:
"Speech spotter: on-demand speech recognition in human-human conversation on the telephone or in face-to-face situations",
1533-1536.
Lee, Kyong-Nim / Chung, Minhwa:
"Pronunciation lexicon modeling and design for Korean large vocabulary continuous speech recognition",
1537-1540.
Möller, Sebastian / Krebber, Jan Felix / Raake, Alexander:
"Performance of speech recognition and synthesis in packet-based networks",
1541-1544.
James, Alastair Bruce / Milner, Ben P. / Gomez, Angel Manuel:
"A comparison of packet loss compensation methods and interleaving for speech recognition in burst-like packet loss",
1545-1548.
Milner, Ben P. / James, Alastair Bruce:
"An analysis of packet loss models for distributed speech recognition",
1549-1552.
Second Language Learning and Spoken Language Processing
Minematsu, Nobuaki:
"Pronunciation assessment based upon the phonological distortions observed in language learners' utterances",
1669-1672.
Suzuki, Yasuo / Sagisaka, Yoshinori / Shirai, Katsuhiko / Muto, Makiko:
"Analysis of the phone level contributions to objective evaluation of English speech by non-natives",
1673-1676.
Wang, Chao / Peabody, Mitchell / Seneff, Stephanie / Kim, Jong-mi:
"An interactive English pronunciation dictionary for Korean learners",
1677-1680.
Rhee, Seok-Chae / Park, Jeon G.:
"Development of the knowledge-based spoken English evaluation system and its application",
1681-1684.
Bernstein, Jared / Barbier, Isabella / Rosenfeld, Elizabeth / Jong, John H.A.L. de:
"Theory and data in spoken language assessment",
1685-1688.
Kawahara, Tatsuya / Dantsuji, Masatake / Tsubota, Yasushi:
"Practical use of English pronunciation system for Japanese students in the CALL classroom",
1689-1692.
Beskow, Jonas / Engwall, Olov / Granstrom, Bjorn / Wik, Preben:
"Design strategies for a virtual language tutor",
1693-1696.
Emerging Research: Human Factors in Speech and Communication Systems
Campana, Ellen / Tanenhaus, Michael K. / Allen, James F. / Remington, Roger W.:
"Evaluating cognitive load in spoken language interfaces using a dual-task paradigm",
1721-1724.
Black, Lesley-Ann / Black, Norman / Harper, Roy / Lemon, Michelle / McTear, Michael:
"The voice-logbook: integrating human factors for a chronic care system",
1725-1728.
Jokinen, Kristiina:
"Communicative competence and adaptation in a spoken dialogue system",
1729-1732.
Fu, Zhan / Pow, Lay Ling / Chen, Fang:
"Evaluation of the difference between the driving behavior of a speech based and a speech-visual based task of an in-car compute",
1733-1736.
Möller, Sebastian / Krebber, Jan Felix / Smeele, Paula M. T.:
"Evaluating system metaphors via the speech output of a smart home system",
1737-1740.
Hammer, Florian / Reichl, Peter / Raake, Alexander:
"Elements of interactivity in telephone conversations",
1741-1744.
Interdisciplinary Topics in Spoken Language Processing
San-Segundo, Ruben / Montero, Juan Manuel / Macias-Guarasa, Javier / Córdoba, Ricardo de / Ferreiros, Javier / Pardo, José Manuel:
"Generating gestures from speech",
1817-1820.
Kanedera, Noboru / Asuka, Sumida / Ikehata, Takao / Funada, Tetsuo:
"Subtopic segmentation in the lecture speech",
1821-1824.
Erickson, Donna / Menezes, Caroline / Fujino, Akinori:
"Some articulatory measurements of real sadness",
1825-1828.
Lee, Chen-Long / Chang, Wen-Whei / Chiang, Yuan-Chuan:
"Application of voice conversion to hearing-impaired Mandarin speech enhancement",
1829-1832.
Kweon, Oh Pyo / Ito, Akinori / Suzuki, Motoyuki / Makino, Shozo:
"A Japanese dialogue-based CALL system with mispronunciation and grammar error detection",
1833-1836.
Jo, Cheolwoo / Bak, Ilsuh:
"Statistics-based direction finding for training vowels",
1837-1840.
Montanari, Simona / Yildirim, Serdar / Andersen, Elaine / Narayanan, Shrikanth:
"Reference marking in children's computer-directed speech: an integrated analysis of discourse and gestures",
1841-1844.
Kim, Jong-mi / Flynn, Suzanne:
"What makes a non-native accent?: a study of Korean English",
1845-1848.
Kim, Sang-Jin / Kim, Kwang-Ki / Hahn, Minsoo:
"Study on emotional speech features in Korean with its aplication to voice color conversion",
1849-1852.
Amano, Shigeaki / Nakatani, Tomohiro / Kondo, Tadahisa:
"Developmental changes in voiced-segment ratio for Japanese infants and parents",
1853-1856.
You, Kisun / Kim, Hoyoun / Sung, Wonyong:
"Implementation of an intonational quality assessment system for a handheld device",
1857-1860.
Beautemps, Denis / Burger, Thomas / Girin, Laurent:
"Characterizing and classifying cued speech vowels from labial parameters",
1861-1864.
Takahashi, Shin-ya / Morimoto, Tsuyoshi / Maeda, Sakashi / Tsuruta, Naoyuki:
"Cough detection in spoken dialogue system for home health care",
1865-1868.
Towards Adaptive Machines: Active and Unsupervised Learning
Yu, Dong / Hwang, Mei-Yuh / Mau, Peter / Acero, Alex / Deng, Li:
"Unsupervised learning from users' error correction in speech dictation",
1969-1972.
Meyer, Gerard G. L. / Kamm, Teresa M.:
"Robustness aspects of active learning for acoustic modeling",
1973-1976.
Visweswariah, Karthik / Gopinath, Ramesh / Goel, Vaibhava:
"Task adaptation of acoustic and language models based on large quantities of data",
1977-1980.
Lussier, Luc / Whittaker, Edward W.D. / Furui, Sadaoki:
"Unsupervised language model adaptation methods for spontaneous speech",
1981-1984.
Nishida, Masafumi / Mamiya, Yoshitaka / Horiuchi, Yasuo / Ichikawa, Akira:
"On-line incremental adaptation based on reinforcement learning for robust speech recognition",
1985-1988.
Watanabe, Tomohiro / Nishizaki, Hiromitsu / Utsuro, Takehito / Nakagawa, Seiichi:
"Unsupervised speaker adaptation using high confidence portion recognition results by multiple recognition systems",
1989-1992.
Speech Coding
Dusan, Sorin / Flanagan, James / Karve, Amod / Balaraman, Mridul:
"Speech coding using trajectory compression and multiple sensors",
1993-1996.
Feldbauer, Christian / Kubin, Gernot:
"How sparse can we make the auditory representation of speech?",
1997-2000.
David, Malah / Shectman, Slava:
"Efficient sub-optimal temporal decomposition with dynamic weighting of speech signals for coding applications",
2001-2004.
Gunawan, Teddy Surya / Ambikairajah, Eliathamby / Epps, Julien:
"Perceptual wavelet packet audio coder",
2005-2008.
Jung, Sung-Kyo / Kang, Hong-Goo / Youn, Dae-Hee / Lee, Chang-Heon:
"Performance analysis of transcoding algorithms in packet-loss environments",
2009-2012.
Falk, Tiago / Chan, Wai-Yip / Kabal, Peter:
"Speech quality estimation using Gaussian mixture models",
2013-2016.
Robust ASR
Kim, Hong Kook / Rahim, Mazin:
"Why speech recognizers make errors ? a robustness view",
1645-1648.
Ahadi, Mohammad / Sheikhzadeh, Hamid / Brennan, Robert / Freeman, George:
"An energy normalization scheme for improved robustness in speech recognition",
1649-1652.
Huerta, Juan / Marcheret, Etienne / Balakrishnan, Sreeram:
"Rapid on-line environment compensation for server - based speech recognition in noisy mobile environments",
1653-1656.
Ansary, Leila / Salehi, Seyyed Ali Seyyed:
"Modeling phones coarticulation effects in a neural network based speech recognition system",
1657-1660.
Willett, Daniel:
"Error - weighted discriminative training for HMM parameter estimation",
1661-1664.
Lo, Wai Kit / Soong, Frank K. / Nakamura, Satoshi:
"Robust verification of recognized words in noise",
1665-1668.
Li, Zili / Tolba, Hesham / O'Shaughnessy, Douglas:
"Robust automatic speech recognition using an optimal spectral amplitude estimator algorithm in low-SNR car environments",
2041-2044.
Zhao, Junhui / Kuang, Jingming / Xie, Xiang:
"Robust speech recognition using data-driven temporal filters based on independent component analysis",
2045-2048.
Kitaoka, Norihide / Wang, Longbiao / Nakagawa, Seiichi:
"Robust distant speech recognition based on position dependent CMN",
2049-2052.
Sakauchi, Sumitaka / Yamaguchi, Yoshikazu / Takahashi, Satoshi / Kobashikawa, Satoshi:
"Robust speech recognition based on HMM composition and modified wiener filter",
2053-2056.
Brito, Ivan / Yoma, Nestor Becerra / Molina, Carlos:
"Feature-dependent compensation in speech recognition",
2057-2060.
Cox, Stephen:
"Using context to correct phone recognition errors",
2061-2064.
Obuchi, Yasunari:
"Improved histogram-based feature compensation for robust speech recognition and unsupervised speaker adaptation",
2065-2068.
Xiong, Zhenyu / Zheng, Fang / Wu, Wenhu:
"Weighting observation vectors for robust speech recognition in noisy environments",
2069-2072.
Tsujikawa, Masanori / Iso, Ken-ichi:
"Hands-free speech recognition using blind source separation post-processed by two-stage spectral subtraction",
2073-2076.
Gomez, Randy / Lee, Akinobu / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Robust speech recognition with spectral subtraction in low SNR",
2077-2080.
Cranen, Bert / Veth, Johan de:
"Active perception: using a priori knowledge from clean speech models to ignore non-target features",
2081-2084.
Xu, Haitian / Tan, Zheng-Hua / Dalsgaard, Paul / Lindberg, Borge:
"Spectral subtraction with full-wave rectification and likelihood controlled instantaneous noise estimation for robust speech recognition",
2085-2088.
Korkmazsky, Filipp / Fohr, Dominique / Illina, Irina:
"Using linear interpolation to improve histogram equalization for speech recognition",
2089-2092.
Hasegawa-Johnson, Mark / Deoras, Ameya:
"A factorial HMM aproach to robust isolated digit recognition in background music",
2093-2096.
Lee, Yoonjae / Ko, Hanseok:
"Multi-eigenspace normalization for robust speech recognition in noisy environments",
2097-2100.
Cerisara, Christophe / Fohr, Dominique / Mella, Odile / Illina, Irina:
"Exploiting models intrinsic robustness for noisy speech recognition",
2101-2104.
Pujol, Pere / Padrell, Jaume / Nadeu, Climent / Macho, Dusan:
"Speech recognition experiments with the SPEECON database using several robust front-ends",
2105-2108.
Ikbal, Shajith / Doss, Mathew Magimai / Misra, Hemant / Bourlard, Hervé:
"Spectro-temporal activity pattern (STAP) features for noise robust ASR",
2109-2112.
Kim, Byoung-Don / Kim, Jin-Young / Choi, Seung-Ho / Lee, Young-Bum / Lee, Kyoung-Rok:
"Improvement of confidence measure performance using background model set algorithm",
2113-2116.
Aradilla, Guillermo / Dines, John / Sivadas, Sunil:
"Using RASTA in task independent TANDEM feature extraction",
2117-2120.
Han, Kyu Jeong / Narayanan, Shrikanth / Srinivasamurthy, Naveen:
"A distributed speech recognition system in multi-user environments",
2121-2124.
Haeb-Umbach, Reinhold / Ion, Valentin:
"Soft features for improved distributed speech recognition over wireless networks",
2125-2128.
Emerging Research
Ebukuro, Rinzou:
"Analysis on disappearing and thriving of speech applications for ergonomic design guidelines and recommendations",
2217-2220.
Smeele, Paula M. T. / Möller, Sebastian / Krebber, Jan Felix:
"Evaluation of the speech output of a smart-home system in a car environment",
2221-2225.
Haas, Ellen:
"How does the integration of speech recognition controls and spatialized auditory displays affect user workload?",
2225-2228.
Chen, Fang:
"Speech interaction system - how to increase its usability?",
2229-2232.
Beringer, Nicole:
"Human language acquisition methods in a machine learning task",
2233-2236.
Spoken Language Resources and Technology Evaluation I
Dybkjaer, Laila / Bernsen, Niels Ole / Minker, Wolfgang:
"New challenges in usability evaluation - beyond task-oriented spoken dialogue systems",
2261-2264.
Kimball, Owen / Kao, Chia-lin / Iyer, Rukmini / Arvizo, Teodoro / Makhoul, John:
"Using quick transcriptions to improve conversational speech models",
2265-2268.
Mishra, Rohit / Shriberg, Elizabeth / Upson, Sandra / Chen, Joyce / Weng, Fuliang / Peters, Stanley / Cavedon, Lawrence / Niekrasz, John / Cheng, Hua / Bratt, Harry:
"A wizard of oz framework for collecting spoken human-computer dialogs",
2269-2272.
Hartikainen, Mikko / Salonen, Esa-Pekka / Turunen, Markku:
"Subjective evaluation of spoken dialogue systems using SER VQUAL method",
2273-2276.
Vasilescu, Ioana / Devillers, Laurence / Clavel, Chloe / Ehrette, Thibaut:
"Fiction database for emotion detection in abnormal situations",
2277-2280.
Sarikaya, Ruhi / Gao, Yuqing / Virga, Paola:
"Fast semi-automatic semantic annotation for spoken dialog systems",
2281-2284.
Wu, Yi-Jian / Kawai, Hisashi / Ni, Jinfu / Wang, Renhua:
"A study on automatic detection of Japanese vowel devoicing for speech synthesis",
2721-2724.
Ciloglu, Tolga / Acar, Dinc / Tokatli, Ahmet:
"Orientel-turkish: telephone speech database description and notes on the experience",
2725-2728.
Yoon, Tae-Jin / Chavarria, Sandra / Cole, Jennifer / Hasegawa-Johnson, Mark:
"Intertranscriber reliability of prosodic labeling on telephone conversation using toBI",
2729-2732.
Tian, Jilei:
"Efficient compression method for pronunciation dictionaries",
2733-2736.
Liang, Min-siong / Lyu, Dau-cheng / Chiang, Yuang-chin / Lyu, Renyuan:
"Construct a multi-lingual speech corpus in taiwan with extracting phonetically balanced articles",
2737-2740.
Heggtveit, Per Olav / Natvig, Jon Emil:
"Automatic prosody labeling of read norwegian",
2741-2744.
Sanders, Eric / Diersen, Andrea / Jongenburger, Willy / Strik, Helmer:
"Towards automatic word segmentation of dialect speech",
2745-2748.
Fousek, Petr / Grezl, Frantisek / Hermansky, Hynek / Svojanovsky, Petr:
"New nonsense syllables database - analyses and preliminary ASR experiments",
2749-2752.
Krebber, Jan Felix / Möller, Sebastian / Raake, Alexander:
"Speech input and output module assessment for remote access to a smart-home spoken dialog system",
2753-2756.
Kim, Dong-Hyun / Roh, Yong-Wan / Hong, Kwang-Seok:
"An implement of speech DB gathering system using voiceXML",
2757-2760.
Almasganj, Farshad:
"Precise phone boundary detection using wavelet packet and recurrent neural networks",
2761-2764.
Morris, Andrew Cameron / Maier, Viktoria / Green, Phil:
"From WER and RIL to MER and WIL: improved evaluation measures for connected speech recognition",
2765-2768.
Rhee, Seok-Chae / Lee, Sook-Hyang / Lee, Young-Ju / Kang, Seok-Keun:
"Design and construction of Korean-spoken English corpus",
2769-2772.
Vriend, Folkert De / Maltese, Giulio:
"Exploring XML-based technologies and procedures for quality evaluation from a real-life case perspective",
2773-2776.
Wang, Kuansan:
"Spoken language interface in ECMA/ISO telecommunication standards",
2777-2780.
Davel, Marelie / Barnard, Etienne:
"The efficient generation of pronunciation dictionaries: machine learning factors during bootstrapping",
2781-2784.
Geumann, Anja:
"Towards a new level of anotation detail of multilingual speech corpora",
2785-2788.
Kawaguchi, Nobuo / Matsubara, Shigeki / Yamaguchi, Yukiko / Takeda, Kazuya / Itakura, Fumitada:
"CIAIR in-car speech database",
2789-2792.
Bael, Christophe Van / Heuvel, Henk van den / Strik, Helmer:
"Investigating speech style specific pronunciation variation in large spoken language corpora",
2793-2796.
Davel, Marelie / Barnard, Etienne:
"The efficient generation of pronunciation dictionaries: human factors during bootstrapping",
2797-2800.
Multi-Modal / Multi-Media Processing
Moore, Roger K.:
"Modeling data entry rates for ASR and alternative input methods",
2285-2288.
Ban, Hiromitsu / Miyajima, Chiyomi / Itou, Katsunobu / Itakura, Fumitada / Takeda, Kazuya:
"Speech recognition using synchronization between speech and finger tapping",
2289-2292.
Gupta, Anurag Kumar / Anastasakos, Tasos:
"Integration patterns during multimodal interaction",
2293-2296.
Marcheret, Etienne / Chu, Stephen M. / Goel, Vaibhava / Potamianos, Gerasimos:
"Efficient likelihood computation in multi-stream HMM based audio-visual speech recognition",
2297-2300.
Choi, Changkyu / Kong, Donggeon / Lee, Hyoung-Ki / Yoon, Sang Min:
"Separation of multiple concurrent speeches using audio-visual speaker localization and minimum variance beam-forming",
2301-2304.
Ariyoshi, Tokitomo / Nakadai, Kazuhiro / Tsujino, Hiroshi:
"Multimodal expression for humanoid robots by integration of human speech mimicking and facial color",
2305-2308.
Automatic Speech Recognition in the Context of Mobile Communications
Novak, Miroslav:
"Towards large vocabulary ASR on embedded platforms",
2309-2312.
Fujimura, Hiroshi / Itou, Katsunobu / Takeda, Kazuya / Itakura, Fumitada:
"Analysis of in-car speech recognition experiments using a large-scale multi-mode dialogue corpus",
2313-2316.
Tan, Zheng-Hua / Dalsgaard, Paul / Lindberg, Borge:
"On the integration of speech recognition into personal networks",
2317-2320.
Rose, Richard / Kim, Hong Kook:
"Robust speech recognition in client-server scenarios",
2321-2324.
Jeong, Sangbae / Han, Iksang / Jon, Eugene / Kim, Jeongsu:
"Memory and computation reduction for embedded ASR systems",
2325-2328.
Robust Features for ASR
Fukuda, Takashi / Nitta, Tsuneo:
"Canonicalization of feature parameters for automatic speech recognition",
2537-2540.
Srinivasan, Soundararajan / Roman, Nicoleta / Wang, DeLiang:
"On binary and ratio time-frequency masks for robust speech recognition",
2541-2544.
Sanchis, Alberto / Juan, Alfons / Vidal, Enrique:
"New features based on multiple word graphs for utterance verification",
2545-2548.
Burget, Lukas:
"Combination of speech features using smoothed heteroscedastic linear discriminant analysis",
2549-2552.
Ikbal, Shajith / Misra, Hemant / Sivadas, Sunil / Hermansky, Hynek / Bourlard, Hervé:
"Entropy based combination of tandem representations for noise robust ASR",
2553-2556.
Yook, Dongsuk / Kim, Donghyun:
"Fast speech adaptation in linear spectral domain for additive and convolutional noise",
2557-2560.
Towards Rapid Speech and Natural Language Application Development: Tooling, Architectures, Components and Standards
Hetherington, Lee:
"The MIT finite-state transducer toolkit for speech and language processing",
2609-2612.
Feng, Junlan / Bangalore, Srinivas / Rahim, Mazin:
"Question-answering in webtalk: an evaluation study",
2613-2616.
Huerta, Juan / Ekanadham, Chaitanya:
"Automatic network optimization of voice applications",
2617-2620.
Rodriguez-Moreno, Miguel Angel / Cuayahuitl, Heriberto / Montiel-Hernandez, Juventino:
"Voicebuilder: a framework for automatic speech application development",
2621-2624.
Facco, Andrea / Falavigna, Daniele / Gretter, Roberto / Vigano, Marcello:
"On the development of telephone applications: some practical issues and evaluation",
2625-2628.
Hamerich, Stefan / Schless, Volker / Kladis, Basilis / Schubert, Volker / Kocsis, Otilia / Igel, Stefan / Córdoba, Ricardo de / Dharo, Luis Fernando / Pardo, José Manuel:
"The GEMINI platform: semi-automatic generation of dialogue applications",
2629-2632.
Speech Coding and Enhancement
Kondo, Kazuhiro / Nakagawa, Kiyoshi:
"A packet loss concealment method using recursive linear prediction",
2633-2636.
Lee, Minkyu / Zitouni, Imed / Zhou, Qiru:
"On a n-gram model approach for packet loss concealment",
2637-2640.
So, Stephen / Paliwal, Kuldip K.:
"Efficient vector quantisation of line spectral frequencies using the switched split vector quantiser",
2641-2644.
Chaitanya, M. / Prasanna, S. R. M. / Yegnanarayana, Bayya:
"Enhancement of reverberant speech using excitation source information",
2645-2648.
Kinoshita, Keisuke / Nakatani, Tomohiro / Miyoshi, Masato:
"Improving automatic speech recognition performance and speech inteligibility with harmonicity based dereverberation",
2649-2652.
Lee, Seung Yeol / Kim, Nam Soo / Chang, Joon-Hyuk:
"Inner product based-multiband vector quantization for wideband speech coding at 16 kbps",
2653-2656.
Abad, Alberto / Hernando, Javier:
"Speech enhancement and recognition by integrating adaptive beamforming and wiener filtering",
2657-2660.
Kim, Kyung-Tae / Jung, Sung-Kyo / Lee, MiSuk / Kang, Hong-Goo / Youn, Dae Hee:
"Temporal normalization techniques for transform-type speech coding and application to split-band wideband coders",
2661-2664.
Asai, Tatsunori / Miyabe, Shigeki / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Interface for barge-in free spoken dialogue system using adaptive sound field control",
2665-2668.
Kim, Jong-Hark / Shin, Jae-Hyun / Lee, In-Sung:
"Multi-mode harmonic transfrom excitation LPC coding for speech and music",
2669-2672.
Gandhi, Mital / Hasegawa-Johnson, Mark:
"Source separation using particle filters",
2673-2676.
Ramo, Anssi / Nurminen, Jani / Himanen, Sakari / Heikkinen, Ari:
"Segmental speech coding model for storage applications",
2677-2680.
Ju, Gwo-hwa / Lee, Lin-shan:
"Improved speech enhancement by applying time-shift property of DFT on hankel matrices for signal subspace decomposition",
2681-2684.
Turunen, Jari Juhani / Tanttu, Juha / Cameron, Frank:
"Minimum phase compensation in speech coding using hammerstein model",
2685-2688.
Li, Weifeng / Itakura, Fumitada / Takeda, Kazuya:
"Optimizing regression for in-car speech recognition using multiple distributed microphones",
2689-2692.
Li, Weifeng / Takeda, Kazuya / Itakura, Fumitada / Tran, Huy Dat:
"Speech enhancement based on magnitude estimation using the gamma prior",
2693-2696.
Errity, Andrew / McKenna, John / Isard, Stephen:
"Unscented kalman filtering of line spectral frequencies",
2697-2700.
Kim, Hyoung-Gook / Sikora, Thomas:
"Speech enhancement based on smoothing of spectral noise floor",
2701-2704.
Li, Junfeng / Akagi, Masato:
"Noise reduction using hybrid noise estimation technique and post-filtering",
2705-2708.
Gabrea, Marcel:
"An adaptive kalman filter for the enhancement of speech signals",
2709-2712.
Sreenivas, T. V. / Sharath Rao, K. / Sreenivasa Murthy, A.:
"Improved iterative wiener filtering for non-stationary noise speech enhancement",
2713-2716.
Qian, Yasheng / Kabal, Peter:
"Highband spectrum envelope estimation of telephone speech using hard/soft-classification",
2717-2720.
Acoustic Modeling for Robust ASR
Korkmazsky, Filipp / Deviren, Murat / Fohr, Dominique / Illina, Irina:
"Hidden factor dynamic Bayesian networks for speech recognition",
2801-2804.
Mao, Mark / Vanhoucke, Vincent:
"Design of compact acoustic models through clustering of tied-covariance Gaussians",
2805-2808.
Raut, Chandra Kant / Nishimoto, Takuya / Sagayama, Shigeki:
"Model composition by lagrange polynomial approximation for robust speech recognition in noisy environment",
2809-2812.
Wu, Jian / Zhu, Donglai / Huo, Qiang:
"A study of minimum classification error training for segmental switching linear Gaussian hidden Markov models",
2813-2816.
Matsuda, Shigeki / Jitsuhiro, Takatoshi / Markov, Konstantin / Nakamura, Satoshi:
"Speech recognition system robust to noise and speaking styles",
2817-2820.
Yoma, Nestor Becerra / Brito, Ivan / Molina, Carlos:
"The stochastic weighted viterbi algorithm: a frame work to compensate additive noise and low-bit rate coding distortion",
2821-2824.
Spoken Dialogue Technology and Systems
Tomko, Stefanie / Rosenfeld, Roni:
"Shaping spoken input in user-initiative systems",
2825-2828.
Pavlovski, Christopher / Lai, Jennifer / Mitchell, Stella:
"Etiology of user experience with natural language speech",
2829-2832.
Rayner, Manny / Hockey, Beth Ann:
"Side effect free dialogue management in a voice enabled procedure browser",
2833-2836.
Lane, Ian Richard / Kawahara, Tatsuya / Ueno, Shinichi:
"Example-based training of dialogue planning incorporating user and situation models",
2837-2840.
Fujie, Shinya / Kobayashi, Tetsunori / Yagi, Daizo / Kikuchi, Hideaki:
"Prosody based attitude recognition with feature selection and its application to spoken dialog system as para-linguistic information",
2841-2844.
Ollason, David / Ju, Yun-Cheng / Bhatia, Siddharth / Herron, Dan / Liu, Jackie:
"MS connect: a fully featured auto-attendant: system design, implementation and performance",
2845-2848.
Multi-Channel Speech Processing
Haeb-Umbach, Reinhold / Peschke, Sven / Warsitz, Ernst:
"Adaptive beamforming combined with particle filtering for acoustic source localization",
2849-2852.
Kwon, Hongseok / Kim, Siho / Bae, Keunsung:
"Time delay estimation using weighted CPSP function",
2853-2856.
Potamitis, Ilyas / Zervas, Panos / Fakotakis, Nikos:
"DOA estimation of speech signals using semi-blind source separation techniques",
2857-2860.
Kim, SangGyun / Yoo, Chang D.:
"Blind separation of speech and sub-Gaussian signals in underdetermined case",
2861-2864.
Jang, Gil-Jin / Choi, Changkyu / Lee, Yong-Beom / Oh, Yung-Hwan:
"Adaptive cross-channel interference cancellation on blind signal separation outputs using source absence/presence detection and spectral subtraction",
2865-2868.
Visser, Erik / Chan, Kwokleung / Kim, Stanley / Lee, Te-Won:
"A comparison of simultaneous 3-channel blind source separation to selective separation on channel pairs using 2-channel BSS",
2869-2872.
Intersection of Spoken Language Processing and Written Language Processing
Lee, Hyun-Bok:
"Towards a harmonious coexistence of spoken and written language",
2873-2876.
Sugito, Miyoko:
"Towards a grammar of spoken language - prosody of ill-formed utterances and listener's understanding in discourse -",
2877-2880.
Kawahara, Tatsuya / Shitaoka, Kazuya / Nanjo, Hiroaki:
"Automatic transformation of lecture transcription into document style using statistical framework",
2881-2884.
Arora, Karunesh / Arora, Sunita / Verma, Kapil / Agrawal, Shyam Sunder:
"Automatic extraction of phonetically rich sentences from large text corpus of indian languages",
2885-2888.
Calzolari, Nicoletta:
"European initiatives to promote cooperation between speech and text communities",
2889-2892.
Prosodic Recognition and Analysis
Takamaru, Keiichi:
"Evaluation of a threshold for detecting local slower phrases in Japanese spontaneous conversational speech",
2969-2972.
Effendy, Nazrul / Maneenoi, Ekkarit / Charnvivit, Patavee / Jitapunkul, Somchai:
"Intonation recognition for indonesian speech based on fujisaki model",
2973-2976.
Zhang, Jin-Song / Nakamura, Satoshi / Hirose, Keikichi:
"Efficient tone classification of speaker independent continuous Chinese speech using anchoring based discriminating features",
2977-2980.
Watanabe, Michiko / Den, Yasuharu / Hirose, Keikichi / Minematsu, Nobuaki:
"Clause types and filed pauses in Japanese spontaneous monologues",
2981-2984.
Yabuta, Yohei / Katagiri, Yasuhiro / Suzuki, Noriko / Takeuchi, Yugo:
"Effect of voice prosody on the decision making process in human-computer interaction",
2985-2988.
Suzuki, Noriko / Katagiri, Yasuhiro:
"Alignment of human prosodic patterns for spoken dialogue systems",
2989-2992.
Kiriyama, Shinya / Kitazawa, Shigeyoshi:
"Evaluation of a prosodic labeling system utilizing linguistic information",
2993-2996.
Blodgett, Allison:
"Functions of intonation boundaries during spoken language comprehension in English",
2997-3000.
Kühne, Marco / Wolff, Matthias / Eichner, Matthias / Hoffmann, Rüdiger:
"Voice activation using prosodic features",
3001-3004.
Kim, Sahyang:
"The role of prosodic cues in word segmentation of Korean",
3005-3008.
Jun, Sun-Ah:
"Default phrasing and attachment preference in Korean",
3009-3012.
Borys, Sarah / Cohen, Aaron / Hasegawa-Johnson, Mark / Cole, Jennifer:
"Modeling and recognition of phonetic and prosodic factors for improvements to acoustic speech recognition models",
3013-3016.
Kong, Eunjong:
"The role of pitch range variation in the discourse structure and intonation structure of Korean",
3017-3020.
Takagi, Kazuyuki / Ozeki, Kazuhiko:
"Dependency analysis of read Japanese sentences using pause and F0 information: a speaker independent case",
3021-3024.
Speer, Shari / Kang, Soyoung:
"Effects of prosodic boundaries on ambiguous syntactic clause boundaries in Japanese",
3025-3028.
Nagasaki, Yasuko / Komatsu, Takanori:
"The superior effectivenes of the F0 range for identifying the context from sounds without phonemes",
3029-3032.
Li, Tan / Karnjanadecha, Montri / Khaorapapong, Thanate:
"A study of tone classification for continuous Thai speech recognition",
3033-3036.
Kim, Key-Seop / Lim, Un / Shin, Dong-Il:
"An acoustic-analytic role for the deviation between the scansion and reading of poems",
3037-3040.
Ohsuga, Tomoko / Nishida, Masafumi / Horiuchi, Yasuo / Ichikawa, Akira:
"Estimating syntactic structure from prosodic features in Japanese speech",
3041-3044.
Komatsu, Masahiko / Sugawara, Tsutomu / Arai, Takayuki:
"Perceptual discrimination of prosodic types and their preliminary acoustic analysis",
3045-3048.
Towards Rapid Speech and Natural Language Application Development
L'Hour, Johann / Boeffard, Olivier / Siroux, Jacques / Miclet, Laurent / Charpentier, Francis / Moudenc, Thierry:
"DORIS, a multiagent/IP platform for multimodal dialogue applications",
3049-3052.
Chen, Yu:
"EVITA-RAD: an extensible enterprise voice porTAI - rapid application development tool",
3053-3056.
D'Haro, Luis F. / Córdoba, Ricardo de / San-Segundo, Ruben / Montero, Juan Manuel / Macias-Guarasa, Javier / Pardo, José Manuel:
"Strategies to reduce design time in multimodal/multilingual dialog applications",
3057-3060.
Aist, Gregory:
"Three-way system-user-expert interactions help you expand the capabilities of an existing spoken dialogue system",
3061-3064.
Fabbrizio, Giuseppe Di / Lewis, Charles:
"Florence: a dialogue manager framework for spoken dialogue systems",
3065-3068.
Kawahara, Tatsuya / Lee, Akinobu / Takeda, Kazuya / Itou, Katsunobu / Shikano, Kiyohiro:
"Recent progress of open-source LVCSR engine julius and Japanese model repository",
3069-3072.
Murao, Hiroya / Kawaguchi, Nobuo / Matsubara, Shigeki / Yamaguchi, Yukiko / Takeda, Kazuya / Inagaki, Yasuyoshi:
"Example-based spoken dialogue system with online example augmentation",
3073-3076.
Bühler, Dirk:
"Enhancing existing form-based dialogue managers with reasoning capabilities",
3077-3080.
Turunen, Markku / Salonen, Esa-Pekka / Hartikainen, Mikko / Hakulinen, Jaakko:
"Robust and adaptive architecture for multilingual spoken dialogue systems",
3081-3084.
Filipe, Porfirio / Mamede, Nuno:
"Towards ubiquitous task management",
3085-3088.