Jerk Minimization for Acoustic-To-Articulatory Inversion

Avni Rajpal, Hemant A. Patil


The effortless speech production in humans requires coordinated movements of the articulators such as lips, tongue, jaw, velum, etc. Therefore, measured trajectories obtained are smooth and slowly varying. However, the trajectories estimated from acoustic-to-articulatory inversion (AAI) are found to be jagged. Thus, energy minimization is used as smoothness constraint for improving performance of the AAI. Besides energy minimization, jerk (i.e., rate of change of acceleration) is known for quantification of smoothness in case of human motor movements. Human motors are organized to achieve intended goal with smoothest possible movements, under the constraint of minimum accelerative transients. In this paper, we propose jerk minimization as an alternative smoothness criterion for frame-based acoustic-to-articulatory inversion. The resultant trajectories obtained are smooth in the sense that for articulatorspecific window size, they will have minimum jerk. The results using this criterion were found to be comparable with inversion schemes based on existing energy minimization criteria for achieving smoothness.


DOI: 10.21437/SSW.2016-14

Cite as

Rajpal, A., Patil, H.A. (2016) Jerk Minimization for Acoustic-To-Articulatory Inversion. Proc. 9th ISCA Speech Synthesis Workshop, 82-87.

Bibtex
@inproceedings{Rajpal+2016,
author={Avni Rajpal and Hemant A. Patil},
title={Jerk Minimization for Acoustic-To-Articulatory Inversion},
year=2016,
booktitle={9th ISCA Speech Synthesis Workshop},
doi={10.21437/SSW.2016-14},
url={http://dx.doi.org/10.21437/SSW.2016-14},
pages={82--87}
}