INTERSPEECH 2011
12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Analysis of Dialectal Influence in Pan-Arabic ASR

Udhyakumar Nallasamy (1), Michael Garbus (1), Florian Metze (1), Qin Jin (1), Thomas Schaaf (2), Tanja Schultz (1)

(1) Carnegie Mellon University, USA
(2) M*Modal, USA

In this paper, we analyze the impact of five Arabic dialects on the front-end and pronunciation dictionary components of an Automatic Speech Recognition (ASR) system. We use ASR's phonetic decision tree as a diagnostic tool to compare the robustness of MFCC and MLP front-ends to dialectal variations in the speech data and found that MLP Bottle-Neck features are less robust to such variations. We also perform a rule-based analysis of the pronunciation dictionary, which enables us to identify dialectal words in the vocabulary and automatically generate pronunciations for unseen words. We show that our technique produces pronunciations with an average phone error rate 9.2%.

Full Paper

Bibliographic reference.  Nallasamy, Udhyakumar / Garbus, Michael / Metze, Florian / Jin, Qin / Schaaf, Thomas / Schultz, Tanja (2011): "Analysis of dialectal influence in pan-Arabic ASR", In INTERSPEECH-2011, 1721-1724.