{We investigate the communications design in a multiagent system (MAS) in which agents cooperate to maximize the averaged sum of discounted one-stage rewards of a collaborative task. Due to the limited communication rate between the agents, each agent should efficiently represent its local observation and communicate an abstract version of the observations to improve the collaborative task performance. We first show that this problem is equivalent to a form of rate-distortion problem which we call task-based information compression (TBIC). We then introduce the state-aggregation for information compression algorithm (SAIC) to solve the formulated TBIC problem. It is shown that SAIC is able to achieve near-optimal performance in terms of the achieved sum of discounted rewards. The proposed algorithm is applied to a rendezvous problem and its performance is compared with several benchmarks. Numerical experiments confirm the superiority of the proposed algorithm.
翻译:{我们调查多试剂系统中的通信设计,在多试剂系统中,代理商进行合作,最大限度地提高合作任务的折扣单阶段奖励的平均和值。由于代理商之间的通信率有限,每个代理商应高效地代表其当地观测,并传递一份抽象的观测结果,以改进协作任务绩效。我们首先表明,这一问题相当于一种按费率扭曲的问题,我们称之为基于任务的信息压缩(TBIC ) 。然后我们引入信息压缩算法(SAIC ) 的国家汇总,以解决已拟订的TBIC 问题。事实证明,SAIC 能够以已实现的折扣报酬总和达到接近最佳的性能。提议的算法适用于汇合问题,其性能与几个基准进行比较。数字实验证实了拟议算法的优越性。