Adaptive Generation in Dialogue Systems using Dynamic User modeling

Authors

  • Srinivasan Janarthanam Heriot Watt University, Edinburgh
  • Oliver Lemon Heriot Watt University, Edinburgh

Abstract

We address the problem of dynamically modeling and adapting to an unknown user in resource-scarcedomains in the context of interactive spoken dialogue systems. As an example, we show how a system couldchoose referring expressions to refer to domain entities for users with different levels of domain expertise,whose domain knowledge is initially unknown to the system. We approach this problem using a three stepprocess: collecting data using a Wizard of Oz method, building a user simulation, and learning to modeland adapt to users using Reinforcement Learning techniques in Markov Decision Processes (MDP). We show that by using a small corpus of non-adaptive dialogues it is possible to learn an adaptive usermodeling policy in resource-scarce domains using a sense-predict-adapt approach. Our evaluation resultsshow that the learned user modeling and adaptation strategies performed better in terms of adaptationthan hand-coded baseline policies on both simulated and real users. With real users, the learned policyproduced around a 20% increase in adaptation in comparison to the best performing hand-coded adaptivebaseline. We also show that adaptation to user’s domain knowledge results in improving task success(99.47% for the learned policy vs 84.7% for the hand-coded baseline) and reducing dialogue time of theconversation (11% relative difference). Users reported that it was easier to identify domain objects whenthe system used adaptive referring expressions during the conversations. We believe that this method canbe extended to other levels of adaptation such as content selection and in other domains where adaptingto users’ domain knowledge is useful, such as travel and healthcare.

Author Biographies

  • Srinivasan Janarthanam, Heriot Watt University, Edinburgh
    Srinivasan Janarthanam is currently a research associate at the Interaction Lab, Heriot Watt University at Edinburgh. His research interests include dialogue management, natural language generation and user modelling in the context of spoken dialogue systems and is currently involved in the SPACEBOOK and HELP4MOOD projects. Previously, he worked on the CLASSIC project at the University of Edinburgh where he also received his Ph.D in 2011 for his dissertation on dynamic user modelling in dialogue systems using reinforcement learning methods. He was a UKIERI (2007-10) Ph.D. scholar funded by the British Council. Before his Ph.D, he worked as a research associate in Amrita University, India and as an applications developer in iNautix Technologies, India. He has a Masters in Intelligent Systems from the University of Sussex, UK and was a Commonwealth Scholar during this period. He did his undergraduate degree in Computer Science and Engineering from Bharathiyar University, India.
  • Oliver Lemon, Heriot Watt University, Edinburgh
    Oliver Lemon is a Professor in the School of Mathematical and Computer Sciences at Heriot-Watt University, Edinburgh,  where he is head of the Interaction Lab. His research interests span  machine learning, speech and language technology, Human-Robot Interaction, and Technology Enhanced Learning.  He previously worked at Stanford University (2000-2002) and at the School of Informatics, Edinburgh University (2003-2009), and he received a PhD from the Centre for Cognitive Science, Edinburgh University in 1996. He was Scientific Coordinator of the European FP6 project “TALK” on in-car speech interfaces, and Coordinator of the FP7 project ”CLASSiC” (www.classic-project.org) which advanced machine learning approaches to spoken dialogue technology. His recent research focuses on the use of machine  learning methods  in interactive systems, including adaptive natural language generation. He has co-authored over 100 peer-reviewed papers and is co-author of the book "Reinforcement Learning for Adaptive Dialogue Systems" (Springer, 2011),  and co-editor of "Data-Driven Methods for Adaptive Spoken Dialogue Systems" (Springer 2012). He is an associate editor for two ACM journals; Transactions on Speech and Language Processing and Transactions on Interactive Intelligent Systems.


Published

2024-12-05

Issue

Section

Long paper