As advances in artificial intelligence enable increasingly capable learning-based autonomous agents, it becomes more challenging for human observers to efficiently construct a mental model of the agent's behaviour. In order to successfully deploy autonomous agents, humans should not only be able to understand the individual limitations of the agents but also have insight on how they compare against one another. To do so, we need effective methods for generating human interpretable agent behaviour summaries. Single agent behaviour summarization has been tackled in the past through methods that generate explanations for why an agent chose to pick a particular action at a single timestep. However, for complex tasks, a per-action explanation may not be able to convey an agents global strategy. As a result, researchers have looked towards multi-timestep summaries which can better help humans assess an agents overall capability. More recently, multi-step summaries have also been used for generating contrasting examples to evaluate multiple agents. However, past approaches have largely relied on unstructured search methods to generate summaries and require agents to have a discrete action space. In this paper we present an adaptive search method for efficiently generating contrasting behaviour summaries with support for continuous state and action spaces. We perform a user study to evaluate the effectiveness of the summaries for helping humans discern the superior autonomous agent for a given task. Our results indicate that adaptive search can efficiently identify informative contrasting scenarios that enable humans to accurately select the better performing agent with a limited observation time budget.
翻译:随着人工智能技术的进步,基于学习的自主智能体越来越具备更强大的能力,人类观察者越来越难以有效地构建智能体的行为心理模型。为了成功地部署自主智能体,人类既需要理解智能体的各项限制,也需要知道它们之间的比较。为此,我们需要有效的方法来生成人类可解释的智能体行为摘要。过去单一智能体行为摘要的研究是通过方法来生成智能体在单个时间步上选择特定动作的解释。然而,在复杂任务中,单个动作的解释可能无法传达智能体的全局策略。因此,研究人员已经开始关注多时间步摘要,这可以更好地帮助人类评估智能体的整体能力。最近,多步摘要也被用于生成对比示例以评估多个智能体。 然而,过去的方法很大程度上依赖非结构化的搜索方法来生成摘要,并且需要智能体具有离散的行动空间。在本文中,我们提出了一种适应性搜索方法,用于高效生成具有支持连续状态和动作空间的对比行为总结。我们进行了一项用户研究,以评估总结的有效性,以帮助人类确定特定任务的优胜自主智能体。我们的结果表明,适应性搜索可以高效地识别信息丰富的对比情况,使人类能够在有限的观察时间预算内准确选择表现更好的智能体。