Recently, many cooperative distributed multi-agent reinforcement learning (MARL) algorithms have been proposed in the literature. In this work, we study the effect of adversarial attacks on a network that employs a consensus-based MARL algorithm. We show that an adversarial agent can persuade all the other agents in the network to implement policies that optimize an objective that it desires. In this sense, the standard consensus-based MARL algorithms are fragile to attacks.
翻译:最近,文献中提出了许多合作分布式多试剂强化学习算法(MARL)建议。在这项工作中,我们研究了对使用协商一致的MARL算法的网络进行对抗性攻击的影响。我们表明,对抗性代理人可以说服网络中所有其他代理商执行优化目标的政策。从这个意义上讲,标准的协商一致MARL算法很容易受到攻击。