Various applications for inter-machine communications are on the rise. Whether it is for autonomous driving vehicles or the internet of everything, machines are more connected than ever to improve their performance in fulfilling a given task. While in traditional communications the goal has often been to reconstruct the underlying message, under the emerging task-oriented paradigm, the goal of communication is to enable the receiving end to make more informed decisions or more precise estimates/computations. Motivated by these recent developments, in this paper, we perform an indirect design of the communications in a multi-agent system (MAS) in which agents cooperate to maximize the averaged sum of discounted one-stage rewards of a collaborative task. Due to the bit-budgeted communications between the agents, each agent should efficiently represent its local observation and communicate an abstracted version of the observations to improve the collaborative task performance. We first show that this problem can be approximated as a form of data-quantization problem which we call task-oriented data compression (TODC). We then introduce the state-aggregation for information compression algorithm (SAIC) to solve the formulated TODC problem. It is shown that SAIC is able to achieve near-optimal performance in terms of the achieved sum of discounted rewards. The proposed algorithm is applied to a geometric consensus problem and its performance is compared with several benchmarks. Numerical experiments confirm the promise of this indirect design approach for task-oriented multi-agent communications.
翻译:各种机器间通信应用正在上升。无论是针对自主驾驶车辆还是各种互联网,机器都比以往更紧密地连接起来,以提高其履行特定任务的业绩。在传统通信中,目标往往是在新兴的任务导向范式下重建基本信息,而通信的目标是使接收方能够作出更知情的决定或作出更精确的估计/计算。在本文件中,我们受这些最新动态的驱动,在一个多试剂系统中间接设计通信。在这个系统中,代理商进行合作,以最大限度地增加合作任务的折扣一阶段奖励的平均总和。由于代理商之间有部分预算的通信,每个代理商应高效地代表其当地观测,并传播观察的抽象版本,以改进协作任务绩效。我们首先显示,这一问题可以被近似为数据量化问题的一种形式,我们称之为任务导向数据压缩(TODC)。然后我们引入了信息压缩算法(SAIC)的国家汇总方法,以便解决已拟订的TODC问题。它表明,由于各代理商之间的通信平均金额,SAIC能够有效地代表其当地观测,并传递一份抽象的观测结果。我们首先表明,它所提出的数字级的计算结果是,其设计中的一种接近地算法的计算结果。