Speech and Language Technology in Education (SLaTE 2013)
English is the only language available for global communication. Due to the influence of speakers' mother tongue, however, those from different regions often have different accents in their pronunciation of English. The ultimate goal of our project is automatic creation of a global pronunciation map of World Englishes on an individual basis, for speakers to use to locate similar English pronunciations. Creating the map mathematically requires a matrix of pronunciation distances among all the speakers considered. Our previous study proposed a good algorithm for that purpose , where, using reference pronunciation distances calculated from labeled data, a pronunciation distance predictor was trained and built for unlabeled data. Due to space limit in , the procedure for calculating reference distances was not described in detail. Then in this paper, detailed descriptions are given and 498 world-wide native and non-native speakers in the Speech Accent Archive are clustered using the reference distances. Results show high accentual validity of the reference interspeaker distances.
Index Terms: World Englishes, IPA transcription, DTW, Speech Accent Archive, phonetic pronunciation clustering
Bibliographic reference. Shen, Han-Ping / Minematsu, Nobuaki / Makino, Takehiko / Weinberger, Steven H. / Pongkittiphan, Teeraphon / Wu, Chung-Hsien (2013): "Speaker-based accented English clustering using a world English archive", In SLaTE-2013, 184-188.