Recent advances in large-scale language modeling and generation have enabled the creation of dialogue agents that exhibit human-like responses in a wide range of conversational scenarios spanning a diverse set of tasks, from general chit-chat to focused goal-oriented discourse. While these agents excel at generating high-quality responses that are relevant to prior context, they suffer from a lack of awareness of the overall direction in which the conversation is headed, and the likelihood of task success inherent therein. Thus, we propose a framework in which dialogue agents can evaluate the progression of a conversation toward or away from desired outcomes, and use this signal to inform planning for subsequent responses. Our framework is composed of three key elements: (1) the notion of a "global" dialogue state (GDS) space, (2) a task-specific progression function (PF) computed in terms of a conversation's trajectory through this space, and (3) a planning mechanism based on dialogue rollouts by which an agent may use progression signals to select its next response.
翻译:在大规模语言建模和生成方面最近取得的进展使得对话机构得以创建,在涉及从一般聊天到有重点的面向目标的讨论等一系列不同任务的广泛对话情景中表现出人性化反应的对话机构。这些机构在产生与先前背景相关的高质量反应方面表现突出,但缺乏对对话所处总方向以及其中所固有的任务成功可能性的认识。因此,我们提议了一个对话机构可据以评价对话进展的框架,用以评价对话进展与预期结果之间的距离,并利用这一信号为今后反应的规划提供信息。我们的框架由三个关键要素组成:(1)“全球”对话状态空间的概念,(2)根据在这一空间的对话轨迹计算的具体任务进展功能,以及(3)基于对话展开的规划机制,使一个机构可以使用进步信号选择其下一个反应。