Despite their widespread adoption, neural conversation models have yet to exhibit natural chat capabilities with humans. In this research, we examine user utterances as causes and generated responses as effects, recognizing that changes in a cause should produce a different effect. To further explore this concept, we have compiled and expanded upon a new dataset called CausalDialogue through crowd-sourcing. This dataset includes multiple cause-effect pairs within a directed acyclic graph (DAG) structure. Our analysis reveals that traditional loss functions can struggle to effectively incorporate the DAG structure, leading us to propose a causality-enhanced method called Exponential Maximum Average Treatment Effect (ExMATE) to enhance the impact of causality at the utterance level in training neural conversation models. To evaluate the effectiveness of this approach, we have built a comprehensive benchmark using the CausalDialogue dataset leveraging large-scale pre-trained language models, and have assessed the results through both human and automatic evaluation metrics for coherence, diversity, and agility. Our findings show that current techniques are still unable to effectively address conversational DAGs, and that the ExMATE method can improve the diversity and agility of conventional loss functions while maintaining coherence.
翻译:尽管广泛采用,神经对话模式尚未表现出与人类的自然聊天能力,但神经对话模式尚未表现出与人类的自然聊天能力。 在这项研究中,我们将用户的言辞作为原因进行检查,并产生反应作为效果,同时认识到一个原因的变化应产生不同的效果。为了进一步探讨这一概念,我们通过众包,汇编并扩展了名为CausalDilogue的新数据集。这个数据集包括了定向自行车图结构中多重因果关系。我们的分析表明,传统的损失功能可能难以有效地纳入DAG结构,导致我们提出一种因果强化的方法,称为 " ExMATE(ExMATE) ",以在培训神经谈话模式中增加语因果的影响。为了评估这一方法的有效性,我们利用CausalDilogue数据集,利用大型的预先培训语言模型(DAGAG)来评估结果。我们的分析显示,传统的损失功能可以通过人与自动评估衡量标准来有效纳入DAG结构,从而导致我们提出一种强化因果性的方法,称为 " ExMATE " (ExMATE) " (ExMATE) " Excience) " (Exmin) " laview dality dolview dislity dislity dislity) comduction) laction and the sal dislence and dislity dislational dislation.