We introduce a model for multi-agent interaction problems to understand how a heterogeneous team of agents should organize its resources to tackle a heterogeneous team of attackers. This model is inspired by how the human immune system tackles a diverse set of pathogens. The key property of this model is "cross-reactivity" which enables a particular defender type to respond strongly to some attackers but weakly to a few different types of attackers. Due to this, the optimal defender distribution that minimizes the harm incurred by attackers is supported on a discrete set. This allows the defender team to allocate resources to a few types and yet tackle a large number of attacker types. We study this model in different settings to characterize a set of guiding principles for control problems with heterogeneous teams of agents, e.g., sensitivity of the harm to sub-optimal defender distributions, teams consisting of a small number of attackers and defenders, estimating and tackling an evolving attacker distribution, and competition between defenders that gives near-optimal behavior using decentralized computation of the control. We also compare this model with reinforcement-learned policies for the defender team.
翻译:我们引入了一个多智能体交互问题的模型,以此来了解一个异构团队应该如何组织资源来对抗一支异构攻击队伍。这个模型受到人类免疫系统如何应对不同病原体的启发。这个模型的关键特性是“交叉反应性”,使得特定的防御者类型对一些攻击者做出强烈反应,但对另外几种攻击者只做出弱反应。由于这一点,最小化受到攻击者伤害的最优防御者分布支持离散集合。这使得防御者团队能够为少数类型分配资源,却能应对数量众多的攻击者类型。我们在不同的设置中研究了这个模型,以表征关于具有异构团队的控制问题的一系列指导原则,例如伤害对次优防御者分布的敏感性,由少数攻击者和防御者组成的团队,估计和对抗不断变化的攻击者分布,以及竞争防御者之间使用去中心化计算控制实现近似最优行为等。我们还将这个模型与强化学习的防御者策略进行了比较。