The vulnerability of machine learning models to adversarial attacks has been attracting considerable attention in recent years. Most existing studies focus on the behavior of stand-alone single-agent learners. In comparison, this work studies adversarial training over graphs, where individual agents are subjected to perturbations of varied strength levels across space. It is expected that interactions by linked agents, and the heterogeneity of the attack models that are possible over the graph, can help enhance robustness in view of the coordination power of the group. Using a min-max formulation of diffusion learning, we develop a decentralized adversarial training framework for multi-agent systems. We analyze the convergence properties of the proposed scheme for both convex and non-convex environments, and illustrate the enhanced robustness to adversarial attacks.
翻译:机器学习模型对对抗性攻击的易受攻击性最近引起了相当大的关注。大多数现有的研究聚焦于独立单一代理人学习器的行为。相比之下,本文研究图形上的对抗性训练,其中单个代理人在空间上受到不同强度水平的扰动。预期通过链接代理人和可能在图形上的攻击模型的异质性,可以在团队的协调力的视角下增强鲁棒性。我们使用扩散学习的极小极大公式,为多智能体系统开发一种分散的对抗性训练框架。我们分析了该方案在凸和非凸环境下的收敛特性,并证明了增强的抵御对抗性攻击能力。