We have a Christmas gift for Harry Potter fans all over the world. In this paper, we present Harry Potter Dialogue (HPD), a dataset that helps train Harry Potter-like dialogue agents. Such a task is typically viewed as a variant of personalized dialogue agents, but they differ significantly in three respects: 1) Harry lived in a virtual world of wizards, thus, real-world commonsense may not apply to Harry's conversations; 2) Harry's behavior is strongly linked to background information in conversations: the scene, its attributes and its relationship to other speakers; and 3) Such backgrounds are dynamically altered as the storyline goes on. The HPD dataset, as the first dataset to facilitate the study of dialogue agent construction for characters within a story, provides rich contextual information about each dialogue session such as scenes, character attributes, and relations. More importantly, all the background information will change over the course of the story. In addition, HPD could support both dialogue generation and retrieval tasks. We evaluate baselines such as Dialog-GPT and BOB to determine the extent to which they can generate Harry Potter-like responses. The experimental results disappoint us in that although the generated responses are fluent, they still seem out of character for Harry. Besides, we validate the current most robust dialogue agent, ChatGPT, which also can't generate plausible Harry-Potter-like responses in some cases, either. Our results suggest that there is much scope for future research.
翻译:我们有一个圣诞礼物给世界各地的哈利波特球迷。在这份文件中,我们展示了哈利波特对话(HPD)这个有助于训练哈利波特式对话代理人的数据集(HPD),这是一个帮助训练哈利波特式对话代理人的数据集。这一任务通常被视为个性化对话代理人的变体,但从三个方面看,任务差异很大:(1) 哈利生活在一个巫师的虚拟世界,因此,现实世界常识可能不适用于哈里的谈话;(2) 哈利的行为与对话中的背景资料紧密相连:场景、特征和与其他发言者的关系;和(3) 这些背景随着故事线的继续而变化。HPD数据集,作为第一个促进研究对话代理人构建故事中人物的数据集,为每场对话会提供丰富的背景信息,例如场景、性格属性和关系。更重要的是,所有背景信息都不会在故事过程中发生改变。此外,HPDD可以支持对话的生成和检索任务。我们评估了像Diallog-GPT和BOBOB这样的基线,以确定它们能够产生哈里波特式反应的程度。实验性结果让我们失望,因为除了透明化的特性之外,我们对当前对话的判断性能产生更强烈的反应。