Effects of Word Embeddings on Neural Network-based Pitch Accent Detection

Sabrina Stehwien, Ngoc Thang Vu, Antje Schweitzer


Pitch accent detection often makes use of both acoustic and lexical features based on the fact that pitch accents tend to correlate with certain words. In this paper, we extend a pitch accent detector that involves a convolutional neural network to include word embeddings, which are state-of-the-art vector representations of words. We examine the effect these features have on within-corpus and cross-corpus experiments on three English datasets. The results show that while word embeddings can improve the performance in corpus-dependent experiments, they also have the potential to make generalization to unseen data more challenging.


 DOI: 10.21437/SpeechProsody.2018-146

Cite as: Stehwien, S., Vu, N.T., Schweitzer, A. (2018) Effects of Word Embeddings on Neural Network-based Pitch Accent Detection. Proc. 9th International Conference on Speech Prosody 2018, 719-723, DOI: 10.21437/SpeechProsody.2018-146.


@inproceedings{Stehwien2018,
  author={Sabrina Stehwien and Ngoc Thang Vu and Antje Schweitzer},
  title={Effects of Word Embeddings on Neural Network-based Pitch Accent Detection},
  year=2018,
  booktitle={Proc. 9th International Conference on Speech Prosody 2018},
  pages={719--723},
  doi={10.21437/SpeechProsody.2018-146},
  url={http://dx.doi.org/10.21437/SpeechProsody.2018-146}
}