We present a new method based on episodic Knowledge Graphs (eKGs) for evaluating (multimodal) conversational agents in open domains. This graph is generated by interpreting raw signals during conversation and is able to capture the accumulation of knowledge over time. We apply structural and semantic analysis of the resulting graphs and translate the properties into qualitative measures. We compare these measures with existing automatic and manual evaluation metrics commonly used for conversational agents. Our results show that our Knowledge-Graph-based evaluation provides more qualitative insights into interaction and the agent's behavior.
翻译:我们提出了一个基于偶发知识图(eKGs)的新方法,用于在开放域内评价(多式)对话媒介。该图来自在对话期间对原始信号的解释,能够捕捉知识的积累时间。我们对所产生的图表进行结构和语义分析,并将属性转化为定性计量。我们将这些措施与现有用于对话媒介的自动和人工评价指标进行比较。我们的结果表明,基于知识格的评价工作对互动和代理人的行为提供了更高质量的了解。