Speech, Prosody, and Machines: Nine Challenges for Prosody Research

Andrew Rosenberg


Speech technology is becoming commonplace. Traditional telephony based interactive voice systems have been joined by virtual assistants and navigation systems to create a broad ecosystem of voice enabled technologies. Prosody is an essential component to human communication, but machines still lag in their ability to understand information communicated prosodically and to produce human-like intonation. This talk poses nine challenges designed to effectively and more thoroughly integrate prosody into current speech technologies. These include long-standing and contemporary concerns surrounding the availability and utility of data, gaps in linguistic theory and specific technological issues. Each of these challenges have received some attention, additional work is necessary to bring the role of prosody in speech technology closer to its role in human communication.


 DOI: 10.21437/SpeechProsody.2018-159

Cite as: Rosenberg, A. (2018) Speech, Prosody, and Machines: Nine Challenges for Prosody Research. Proc. 9th International Conference on Speech Prosody 2018, 784-793, DOI: 10.21437/SpeechProsody.2018-159.


@inproceedings{Rosenberg2018,
  author={Andrew Rosenberg},
  title={Speech, Prosody, and Machines: Nine Challenges for Prosody Research},
  year=2018,
  booktitle={Proc. 9th International Conference on Speech Prosody 2018},
  pages={784--793},
  doi={10.21437/SpeechProsody.2018-159},
  url={http://dx.doi.org/10.21437/SpeechProsody.2018-159}
}