Online learning and transfer for user adaptation in dialogue systems

Carrara Nicolas, Romain Laroche, Olivier Pietquin


We address the problem of user adaptation in Spoken Dialogue Systems. The goal is to quickly adapt online to a new user given a large amount of dialogues collected with other users. Previous works using Transfer for Reinforcement Learning tackled this problem when the number of source users remains limited. In this paper, we overcome this constraint by clustering the source users: each user cluster, represented by its centroid, is used as a potential source in the state-of-the-art Transfer Reinforcement Learning algorithm. Our benchmark compares several clustering approaches, including one based on a novel metric. All experiments are led on a negotiation dialogue task, and their results show significant improvements over baselines.


 DOI: 10.21437/SemDial.2017-15

Cite as: Nicolas, C., Laroche, R., Pietquin, O. (2017) Online learning and transfer for user adaptation in dialogue systems. Proc. SEMDIAL 2017 (SaarDial) Workshop on the Semantics and Pragmatics of Dialogue, 137-145, DOI: 10.21437/SemDial.2017-15.


@inproceedings{Nicolas2017,
  author={Carrara Nicolas and Romain Laroche and Olivier Pietquin},
  title={Online learning and transfer for user adaptation in dialogue systems},
  year=2017,
  booktitle={Proc. SEMDIAL 2017 (SaarDial) Workshop on the Semantics and Pragmatics of Dialogue},
  pages={137--145},
  doi={10.21437/SemDial.2017-15},
  url={http://dx.doi.org/10.21437/SemDial.2017-15}
}