A Stochastic Model for Computer-Aided Human-Human Dialogue

Merwan Barlier, Romain Laroche, Olivier Pietquin

In this paper we introduce a novel model for computer-aided human-human dialogue. In this context, the computer aims at improving the outcome of a human-human task-oriented dialogue by intervening during the course of the interaction. While dialogue state and topic tracking in human-human dialogue have already been studied, few work has been devoted to the sequential part of the problem, where the impact of the system’s actions on the future of the conversation is taken into account. This paper addresses this issue by first modelling human-human dialogue as a Markov Reward Process. The task of purposely taking part into the conversation is then optimised within the Linearly Solvable Markov Decision Process framework. Utterances of the Conversational Agent are seen as perturbations in this process, which aim at satisfying the user’s long-term goals while keeping the conversation natural. Finally, results obtained by simulation suggest that such an approach is suitable for computer-aided human-human dialogue and is a first step towards three-party dialogue.

DOI: 10.21437/Interspeech.2016-479

Cite as

Barlier, M., Laroche, R., Pietquin, O. (2016) A Stochastic Model for Computer-Aided Human-Human Dialogue. Proc. Interspeech 2016, 2051-2055.

author={Merwan Barlier and Romain Laroche and Olivier Pietquin},
title={A Stochastic Model for Computer-Aided Human-Human Dialogue},
booktitle={Interspeech 2016},