ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Articulatory copy synthesis from cine x-ray films

Yves Laprie, Matthieu Loosvelt, Shinji Maeda, Rudolph Sock, Fabrice Hirsch

This paper deals with articulatory copy synthesis from X-ray films. The underlying articulatory synthesizer uses an aerodynamic and an acoustic simulation using target area functions, F0 and transition patterns from one area function to the next as input data. The articulators, tongue in particular, have been delineated by hand or semi-automatically from the X-ray films. A specific attention has been paid on the determination of the centerline of the vocal tract from the image and on the coordination between glottal area and vocal tract constrictions since both aspects strongly impact on the acoustics. Experiments show that good quality speech can be resynthesized even if the interval between two images is 40 ms. The same approach could be easily applied to cine MRI data.

doi: 10.21437/Interspeech.2013-480

Cite as: Laprie, Y., Loosvelt, M., Maeda, S., Sock, R., Hirsch, F. (2013) Articulatory copy synthesis from cine x-ray films. Proc. Interspeech 2013, 2024-2028, doi: 10.21437/Interspeech.2013-480

  author={Yves Laprie and Matthieu Loosvelt and Shinji Maeda and Rudolph Sock and Fabrice Hirsch},
  title={{Articulatory copy synthesis from cine x-ray films}},
  booktitle={Proc. Interspeech 2013},