INTERSPEECH 2006 - ICSLP
This paper proposes a trajectory model which is based on a mixture density network trained with target features augmented with dynamic features together with an algorithm for estimating maximum likelihood trajectories which respects constraints between the static and derived dynamic features. This model was evaluated on an inversion mapping task. We found the introduction of the trajectory model successfully reduced root mean square error by up to 7.5%, as well as increasing correlation scores.
Bibliographic reference. Richmond, Korin (2006): "A trajectory mixture density network for the acoustic-articulatory inversion mapping", In INTERSPEECH-2006, paper 1790-Mon3WeS.4.