Show Us the Way: Learning to Manage Dialog from Demonstrations

التفاصيل البيبلوغرافية
العنوان: Show Us the Way: Learning to Manage Dialog from Demonstrations
المؤلفون: Gordon-Hall, Gabriel, Gorinski, Philip John, Lampouras, Gerasimos, Iacobacci, Ignacio
سنة النشر: 2020
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computation and Language, Computer Science - Machine Learning, Computer Science - Neural and Evolutionary Computing
الوصف: We present our submission to the End-to-End Multi-Domain Dialog Challenge Track of the Eighth Dialog System Technology Challenge. Our proposed dialog system adopts a pipeline architecture, with distinct components for Natural Language Understanding, Dialog State Tracking, Dialog Management and Natural Language Generation. At the core of our system is a reinforcement learning algorithm which uses Deep Q-learning from Demonstrations to learn a dialog policy with the help of expert examples. We find that demonstrations are essential to training an accurate dialog policy where both state and action spaces are large. Evaluation of our Dialog Management component shows that our approach is effective - beating supervised and reinforcement learning baselines.
Comment: 8 pages + 2 pages references, 4 figures, 4 tables, accepted to DSTC8 Workshop at AAAI2020
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2004.08114
رقم الأكسشن: edsarx.2004.08114
قاعدة البيانات: arXiv