争取在多机构系统中实现真正无损失、无损失和无失失的简简简通信 (Towards True Lossless Sparse Communication in Multi-Agent Systems)

Communication enables agents to cooperate to achieve their goals. Learning when to communicate, i.e., sparse (in time) communication, and whom to message is particularly important when bandwidth is limited. Recent work in learning sparse individualized communication, however, suffers from high variance during training, where decreasing communication comes at the cost of decreased reward, particularly in cooperative tasks. We use the information bottleneck to reframe sparsity as a representation learning problem, which we show naturally enables lossless sparse communication at lower budgets than prior art. In this paper, we propose a method for true lossless sparsity in communication via Information Maximizing Gated Sparse Multi-Agent Communication (IMGS-MAC). Our model uses two individualized regularization objectives, an information maximization autoencoder and sparse communication loss, to create informative and sparse communication. We evaluate the learned communication `language' through direct causal analysis of messages in non-sparse runs to determine the range of lossless sparse budgets, which allow zero-shot sparsity, and the range of sparse budgets that will inquire a reward loss, which is minimized by our learned gating function with few-shot sparsity. To demonstrate the efficacy of our results, we experiment in cooperative multi-agent tasks where communication is essential for success. We evaluate our model with both continuous and discrete messages. We focus our analysis on a variety of ablations to show the effect of message representations, including their properties, and lossless performance of our model.

翻译：通信使代理商能够合作实现其目标。学习何时沟通,即(在时间上)通信稀少,在带宽有限的情况下,谁对信息特别重要。最近学习个人化通信的工作在培训期间差异很大,但培训期间通信减少的代价是报酬减少,特别是在合作任务方面。我们利用信息瓶颈重新定义宽度作为代表学习问题,我们显示,这自然使得在预算比以往低的情况下能够进行无损失的分散通信。在本文件中,我们提出一种方法,通过信息最大化的分散多发多发通信(IMGS-MAC),使通信真正无损无损的松散。我们的模式使用两种个化的规范化目标,即信息最大化自动编码和零散的通信损失,来创造信息性和分散的沟通。我们通过直接因果性分析非扭曲的信息来评估所学的“语言”通信,以确定损失无损失的预算的范围,从而允许零光度的松懈的预算,以及大量预算将获得奖励的损失范围缩小,我们通过我们所学的模型功能,用微的微缩多发式多发的图像来尽量减少。我们用合作性的信息分析显示我们的工作效率,我们的合作性分析,我们如何评价我们如何展示我们的工作成果,包括合作性分析。