A desirable goal for autonomous agents is to be able to coordinate on the fly with previously unknown teammates. Known as "ad hoc teamwork", enabling such a capability has been receiving increasing attention in the research community. One of the central challenges in ad hoc teamwork is quickly recognizing the current plans of other agents and planning accordingly. In this paper, we focus on the scenario in which teammates can communicate with one another, but only at a cost. Thus, they must carefully balance plan recognition based on observations vs. that based on communication. This paper proposes a new metric for evaluating how similar are two policies that a teammate may be following - the Expected Divergence Point (EDP). We then present a novel planning algorithm for ad hoc teamwork, determining which query to ask and planning accordingly. We demonstrate the effectiveness of this algorithm in a range of increasingly general communication in ad hoc teamwork problems.
翻译:自主代理人的一个理想目标是能够与以前未知的队友协调飞行。 被称为“临时团队合作”,使这种能力在研究界受到越来越多的关注。 临时团队合作的主要挑战之一是迅速认识其他代理人目前的计划并据此进行规划。 在本文件中,我们侧重于队友能够相互交流但只收费的情景。 因此,他们必须谨慎地平衡基于观察和基于交流的意见对计划的承认。本文件提出了一个新的衡量标准,用以评估一个队友可能遵循的两种政策——预期差异点(EDP)——的相似性。 然后我们为特别团队合作提出一个新的规划算法,确定哪些问题可以查询和相应规划。我们通过在特殊团队合作问题上日益普遍的沟通,展示了这种算法的有效性。