GPT建筑和目标国家跟踪强化多面对话系统的生成用户模拟模拟器 (A Generative User Simulator with GPT-based Architecture and Goal State Tracking for Reinforced Multi-Domain Dialog Systems)

Building user simulators (USs) for reinforcement learning (RL) of task-oriented dialog systems (DSs) has gained more and more attention, which, however, still faces several fundamental challenges. First, it is unclear whether we can leverage pretrained language models to design, for example, GPT-2 based USs, to catch up and interact with the recently advanced GPT-2 based DSs. Second, an important ingredient in a US is that the user goal can be effectively incorporated and tracked; but how to flexibly integrate goal state tracking and develop an end-to-end trainable US for multi-domains has remained to be a challenge. In this work, we propose a generative user simulator (GUS) with GPT-2 based architecture and goal state tracking towards addressing the above two challenges. Extensive experiments are conducted on MultiWOZ2.1. Different DSs are trained via RL with GUS, the classic agenda-based user simulator (ABUS) and other ablation simulators respectively, and are compared for cross-model evaluation, corpus-based evaluation and human evaluation. The GUS achieves superior results in all three evaluation tasks.

翻译：建立用户模拟器(US)以加强面向任务的对话系统(DS)的学习,这一点越来越受到越来越多的关注,然而,仍然面临着一些根本性的挑战。首先,我们是否能够利用预先训练的语言模型来设计,例如,以美国为基地的GPT-2系统,以便赶上最近先进的基于GPT-2系统的DS并与之互动。第二,美国的一个重要成份是用户目标可以有效纳入并跟踪;但是,如何灵活整合目标状态跟踪和开发一个可用于多域的端到端训练的美国仍是一个挑战。在这项工作中,我们提议用基于GPT-2的架构和目标状态跟踪来配制一个基因化用户模拟器(GUS),以应对上述两项挑战。在MultiWOZ2.1上进行了广泛的实验。不同的DS通过RL与GUS、经典基于议程的用户模拟器(ABUS)和其他模拟器(ABUS)培训,并分别将所有三项评价任务进行比较。

相关内容

DSS

关注 472

决策支持系统（Decision Support Systems）期刊中发表的文章的共同主线是它们与支持增强决策制定的理论和技术问题的相关性。所涉及的领域可能包括基础、功能、接口、实现、影响和决策支持系统(DSS)的评估。手稿可以从不同的方法和方法学中获得，包括决策理论、经济学、计量经济学、统计学、计算机支持的协作工作、数据库管理、语言学、管理科学、数学建模、运营管理、认知科学、心理学、用户界面管理等。但是，一份侧重于对任何这些相关领域的直接贡献的手稿应提交给适合于特定领域的机构。官网地址：http://dblp.uni-trier.de/db/journals/dss/

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日