ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Quantifying cross-linguistic variation in grapheme-to-phoneme mapping

Martine Coene, Annemiek Hammer, Wojtek Kowalczyk, Louis ten Bosch, Bart Vaerenberg, Paul J. Govaerts

In the literature, languages have been identified as having more or less transparent orthographies, depending on the degree of predictability of their spelling-to-sound correspondences. Quantitative measures based on large-scaled language corpora which are capable to objectively assess such cross-linguistic variation are rather scarce. The quantitative assessment method presented here builds on the correlation between distances of phonemic and graphemic frequency distributions of a given sample and similar distances obtained from large corpora of the same language. The metric itself may be used as a research tool to investigate the potential effect of orthographic transparency on the development and performance of reading in different populations.

doi: 10.21437/Interspeech.2013-456

Cite as: Coene, M., Hammer, A., Kowalczyk, W., Bosch, L.t., Vaerenberg, B., Govaerts, P.J. (2013) Quantifying cross-linguistic variation in grapheme-to-phoneme mapping. Proc. Interspeech 2013, 1854-1857, doi: 10.21437/Interspeech.2013-456

  author={Martine Coene and Annemiek Hammer and Wojtek Kowalczyk and Louis ten Bosch and Bart Vaerenberg and Paul J. Govaerts},
  title={{Quantifying cross-linguistic variation in grapheme-to-phoneme mapping}},
  booktitle={Proc. Interspeech 2013},