Speech and Language Technology in Education (SLaTE 2013)

Grenoble, France
August 30-September 1, 2013

Speaker-based Accented English Clustering Using a World English Archive

Han-Ping Shen (1,2), Nobuaki Minematsu (2), Takehiko Makino (3), Steven H. Weinberger (4), Teeraphon Pongkittiphan (2), Chung-Hsien Wu (1)

(1) National Cheng Kung University, Tainan, Taiwan
(2) The University of Tokyo, Tokyo, Japan
(3) Chuo University, Tokyo, Japan
(4) George Mason University, Virginia, USA

English is the only language available for global communication. Due to the influence of speakers' mother tongue, however, those from different regions often have different accents in their pronunciation of English. The ultimate goal of our project is automatic creation of a global pronunciation map of World Englishes on an individual basis, for speakers to use to locate similar English pronunciations. Creating the map mathematically requires a matrix of pronunciation distances among all the speakers considered. Our previous study proposed a good algorithm for that purpose [1], where, using reference pronunciation distances calculated from labeled data, a pronunciation distance predictor was trained and built for unlabeled data. Due to space limit in [1], the procedure for calculating reference distances was not described in detail. Then in this paper, detailed descriptions are given and 498 world-wide native and non-native speakers in the Speech Accent Archive are clustered using the reference distances. Results show high accentual validity of the reference interspeaker distances.


  1. H.-P. Shen, N. Miiiematsu. S. H. Weinberger. T. Makino. J. Novak. T. Pongkittiphan. C.-H. Wu. "Speaker-based pronunciation clustering of World Englishes based on pronunciation structure analysis." IEICE Technical Report. SP2012-116. pp.7-12 (2013-2)

Index Terms: World Englishes, IPA transcription, DTW, Speech Accent Archive, phonetic pronunciation clustering

