Existing approaches to coalition formation often assume that requirements associated with tasks are precisely specified by the human operator. However, prior work has demonstrated that humans, while extremely adept at solving complex problems, struggle to explicitly state their solution strategy. Further, existing approaches often ignore the fact that experts may utilize different, but equally-valid, solutions (i.e., heterogeneous strategies) to the same problem. In this work, we propose a two-part framework to address these challenges. First, we tackle the challenge of inferring implicit strategies directly from expert demonstrations of coalition formation. To this end, we model and infer such heterogeneous strategies as capability-based requirements associated with each task. Next, we propose a method capable of adaptively selecting one of the inferred strategies that best suits the target team without requiring additional training. Specifically, we formulate and solve a constrained optimization problem that simultaneously selects the most appropriate strategy given the target team's capabilities, and allocates its constituents into appropriate coalitions. We evaluate our approach against several baselines, including some that resemble existing approaches, using detailed numerical simulations, StarCraft II battles, and a multi-robot emergency-response scenario. Our results indicate that our framework consistently outperforms all baselines in terms of requirement satisfaction, resource utilization, and task success rates.
翻译:然而,先前的工作表明,人类虽然非常擅长解决复杂问题,但努力明确地阐明其解决方案战略。此外,现有方法往往忽视专家对同一问题可能使用不同但同样有效的解决方案(即多样化战略)这一事实。在这项工作中,我们提出一个由两部分组成的框架来应对这些挑战。首先,我们应对直接从建立联盟的专家示范中推断出隐含战略的挑战。为此目的,我们模拟并推断出不同战略,例如与每项任务相关的基于能力的要求。接下来,我们提出一种方法,能够适应地选择最适合目标小组的推论战略之一,而无需额外培训。具体地说,我们制定并解决一个限制最优化的问题,根据目标小组的能力同时选择最适当的战略,并将其成员分配到适当的联盟中。我们根据若干基线,包括一些与现有方法相似的基线,利用详细的数字模拟,StarCraft II战斗,以及一个多机器人应急设想中的各种要求。我们的资源利用率基准,表明我们所有的资源使用率基准,以及各种满足率。