Dialogue agents that interact with humans in situated environments need to manage referential ambiguity across multiple modalities and ask for help as needed. However, it is not clear what kinds of questions such agents should ask nor how the answers to such questions can be used to resolve ambiguity. To address this, we analyzed dialogue data from an interactive study in which participants controlled a virtual robot tasked with organizing a set of tools while engaging in dialogue with a live, remote experimenter. We discovered a number of novel results, including the distribution of question types used to resolve ambiguity and the influence of dialogue-level factors on the reference resolution process. Based on these empirical findings we: (1) developed a computational model for clarification requests using a decision network with an entropy-based utility assignment method that operates across modalities, (2) evaluated the model, showing that it outperforms a slot-filling baseline in environments of varying ambiguity, and (3) interpreted the results to offer insight into the ways that agents can ask questions to facilitate situated reference resolution.
翻译:为解决这一问题,我们分析了互动研究中的对话数据,在互动研究中,参与者控制了一个虚拟机器人,负责组织一套工具,同时与现场远程实验者进行对话。我们发现了一些新的结果,包括用于解决模糊性问题的类别分布以及对话级别因素对参考解答进程的影响。根据这些经验性调查结果,我们:(1) 开发了一个用于澄清请求的计算模型,使用一个具有基于英特罗普的通用分配方法的决策网络,该模式可跨模式运作,(2) 评价该模型,表明该模型在模棱两可的环境中超过了一个填补时间档的基线,并(3) 解释结果,以深入了解代理人如何提出问题,以促进位于位置的参考解答。