The 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages

29-31 August 2018, Gurugram, India

Chair: Shyam S. Agrawal

DOI: 10.21437/SLTU.2018

Keynote:Pushpak Bhattacharya


Machine Translation of Low Resource Related Languages
Pushpak Bhattacharya


Keynote:Emmanuel Dupoux


Zero Resource Speech Technology:Past, Present, and Future
Emmanuel Dupoux


Zero and Low Resources Scenario


Optimizing DPGMM Clustering in Zero Resource Setting Based on Functional Load
Bin Wu, Sakriani Sakti, Jinsong Zhang, Satoshi Nakamura

Low-resource Tibetan Dialect Acoustic Modeling Based on Transfer Learning
Jinghao Yan, Zhiqiang Lv, Shen Huang, Hongzhi Yu

Interspeech 2018 Low Resource Automatic Speech Recognition Challenge for Indian Languages
Brij Mohan Lal Srivastava, Sunayana Sitaram, Rupesh Kumar Mehta, Krishna Doss Mohan, Pallavi Matani, Sandeepkumar Satpal, Kalika Bali, Radhakrishnan Srikanth, Niranjan Nayak

Advances in Low Resource ASR: A Deep Learning Perspective
Hardik Sailor, Ankur Patil, Hemant Patil

Automatic Speech Recognition for Humanitarian Applications in Somali
Raghav Menon, Astik Biswas, Armin Saeb, John Quinn, Thomas Niesler

Signal Processing Cues to Improve Automatic Speech Recognition for Low Resource Indian Languages
Arun Baby, Karthik Pandia D S, Hema A Murthy


Data Collection and Crowd Sourcing


Diarization in Maximally Ecological Recordings: Data from Tsimane Children
Julien Karaday, Camila Scaff, Alejandrina Cristia

A Small Griko-Italian Speech Translation Corpus
Marcely Zanon Boito, Antonios Anastasopoulos, Aline Villavicencio, Laurent Besacier, Marika Lekakou

Corpus Construction and Semantic Analysis of Indonesian Image Description
Khumaisa Nur'Aini, Johanes Effendi, Sakriani Sakti, Mirna Adriani, Satoshi Nakamura

Designing an IVR Based Framework for Telephony Speech Data Collection and Transcription in Under-Resourced Languages
Joyanta Basu, Soma Khan, Milton Samirakshma Bepari, Rajib Roy, Madhab Pal, Sushmita Nandi

Crowd-Sourced Speech Corpora for Javanese, Sundanese, Sinhala, Nepali, and Bangladeshi Bengali
Oddur Kjartansson, Supheakmungkol Sarin, Knot Pipatsrisawat, Martin Jansche, Linne Ha

IIITH-ILSC Speech Database for Indain Language Identification
Ravi Kumar Vuddagiri, Krishna Gurugubelli, Priyam Jain, Hari Krishna Vydana, Anil Kumar Vuppala


Poster Session I


Mining Training Data for Language Modeling Across the World's Languages
Manasa Prasad, Theresa Breiner, Daan van Esch

A Step-by-Step Process for Building TTS Voices Using Open Source Data and Frameworks for Bangla, Javanese, Khmer, Nepali, Sinhala, and Sundanese
Keshan Sodimana, Pasindu De Silva, Supheakmungkol Sarin, Oddur Kjartansson, Martin Jansche, Knot Pipatsrisawat, Linne Ha

Prosodic Analysis of Non-Native South Indian English Speech
Radha Krishna Guntur, R Krishnan, V.K. Mittal

Implementation of Concatenation Technique for Low Resource Text-To-Speech System Based on Marathi Talking Calculator
Monica Mundada, Sangramsing Kayte, Pradip Das

A Unified Phonological Representation of South Asian Languages for Multilingual Text-to-Speech
Isin Demirsahin, Martin Jansche, Alexander Gutkin

Acoustic Characretistics of Schwa Vowel in Punjabi.
Swaran Lata, Prashant Verma, Simerjeet Kaur

A Comparative Study of SMT and NMT: Case Study of English-Nepali Language Pair
Praveen Acharya, Bal Krishna Bal

Post-Processing Using Speech Enhancement Techniques for Unit Selection and Hidden Markov Model Based Low Resource Language Marathi Text-to-Speech System
Sangramsing Kayte, Monica Mundada

Relative Phase Shift Features for Replay Spoof Detection System
Srinivas Kantheti, Hemant Patil

Empirical Study of Speech Synthesis Markup Language and Its Implementation for Punjabi Language
Atul Kumar, Shyam Agrawal

Development of IIITH Hindi-English Code Mixed Speech Database
Banothu Rambabu, Suryakanth V Gangashetty

Sinhala G2P Conversion for Speech Processing
Thilini Nadungodage, Chamila Liyanage, Amathri Prerera, Randil Pushpananda, Ruvan Weerasinghe


Code Switching and Speech Detection


Automatic Detection of Palatalized Consonants in Kashmiri
Ramakrishna Thirumuru, Krishna Gurugubelli, Anil Kumar Vuppala

Improving ASR for Code-Switched Speech in Under-Resourced Languages Using Out-of-Domain Data
Astik Biswas, Ewald van der Westhuizen, Thomas Niesler, Febe de Wet

Code-Switching Detection with Data-Augmented Acoustic and Language Models
Emre Yilmaz, Henk Van Den Heuvel

SVM Based Language Diarization for Code-Switched Bilingual Indian Speech Using Bottleneck Features
Spoorthy V, Veena Thenkanidiyoor, Dileep A.D

Dialect Identification Using Tonal and Spectral Features in Two Dialects of Ao
Moakala Tzudir, Priyankoo Sarmah, S R Mahadeva Prasanna


Speech Synthesis


DNN Based Myanmar Speech Synthesis
Aye Mya Hlaing, Win Pa Pa, Ye Kyaw Thu

Text Normalization for Bangla, Khmer, Nepali, Javanese, Sinhala and Sundanese Text-to-Speech Systems
Keshan Sodimana, Pasindu De Silva, Richard Sproat, Theeraphol Wattanavekin, Alexander Gutkin, Knot Pipatsrisawat

Building a Natural Sounding Text-to-Speech System for the Nepali Language - Research and Development Challenges and Solutions
Roop Bajracharya, Santosh Regmi, Bal Krishna Bal, Balaram Prasain

A Human Quality Text to Speech System for Sinhala
Lakshika Nanayakkara, Chamila Liyanage, Pubudu Tharaka Viswakula, Thilini Nagungodage, Randil Pushpananda, Ruvan Weerasinghe


Automatic Speech Recognition and Language Identification


Neural Networks-based Automatic Speech Recognition for Agricultural Commodity in Gujarati Language
Hardik Sailor, Hemant Patil

Building an ASR System for Mboshi Using A Cross-Language Definition of Acoustic Units Approach
Odette Scharenborg, Patrick Ebel, Mark Hasegawa-Johnson, Najim Dehak

Incorporating Speaker Normalizing Capabilities to an End-to-End Speech Recognition System
Hari Krishna, Sivanand Achanta, Anil Kumar Vuppala

Language Identification of Assamese, Bengali and English Speech
Joyshree Chakraborty, Shikhamoni Nath, Nirmala S R, Samudravijaya K

ASR-Free CNN-DTW Keyword Spotting Using Multilingual Bottleneck Features for Almost Zero-Resource Languages
Raghav Menon, Herman Kamper, Emre Yilmaz, John Quinn, Thomas Niesler

Improving ASR Output for Endangered Language Documentation
Robbie Jimerson, Kruthika Simha, Raymond Ptucha, Emily Prudhommeaux


Poster Session II


Assessing Performance of Bengali Speech Recognizers Under Real World Conditions using GMM-HMM and DNN based Methods
Soma Khan, Madhab Pal, Joyanta Basu, Milton Samirakshma Bepari, Rajib Roy

Evaluating Code-Switched Malay-English Speech Using Time Delay Neural Networks
Anand Singh, Tien-Ping Tan

Hindi Speech Vowel Recognition Using Hidden Markov Model
Shobha Bhatt, Amita Dev, Anurag Jain

Building Speech Recognition Systems for Language Documentation: The CoEDL Endangered Language Pipeline and Inference System (ELPIS)
Ben Foley, Josh Arnold, Rolando Coto-Solano, Gautier Durantin, T. Mark Ellison, Daan van Esch, Scott Heath, František Kratochvíl, Zara Maxwell-Smith, David Nash, Ola Olsson, Mark Richards, Nay San, Hywel Stoakes, Nick Thieberger, Janet Wiles

Improved Language Identification Using Stacked SDC Features and Residual Neural Network
Ravi Kumar Vuddagiri, Hari Krishna Vydana, Anil Kumar Vuppala

Investigating the Use of Mixed-Units Based Modeling for Improving Uyghur Speech Recognition
Pengfei Hu, Shen Huang, Zhiqiang Lv

Development of Assamese Continuous Speech Recognition System
Barsha Deka, Nirmala S.R., Samudravijaya K.

Segmental and Supra Segmental Feature Based Speech Recognition System for Under Resourced Languages
Tanmay Bhowmik, Shyamal Kumar Das Mandal

Application of Egyptian Vulture Optimization in Speech Emotion Recognition
Shreya Sahu, Arpan Jain, Ritu Tiwari, Anupam Shukla

Marathi Speech Recognition
Supriya Paulose, Shikhamoni Nath, Samudravijaya K

Building an Automatic Speech Recognition System in Sora Language Using Data Collected for Acoustic Phonetic Studies
Kishalay Chakraborty, Luke Horo, Priyankoo Sarmah

JAMLIT: A Corpus of Jamaican Standard English for Automatic Speech Recognition of Children’s Speech
Stefan Watson, Andre Coy