In this paper, we consider a discrete-time multi-agent system involving $N$ cost-coupled networked rational agents solving a consensus problem and a central Base Station (BS), scheduling agent communications over a network. Due to a hard bandwidth constraint on the number of transmissions through the network, at most $R_d < N$ agents can concurrently access their state information through the network. Under standard assumptions on the information structure of the agents and the BS, we first show that the control actions of the agents are free of any dual effect, allowing for separation between estimation and control problems at each agent. Next, we propose a weighted age of information (WAoI) metric for the scheduling problem of the BS, where the weights depend on the estimation error of the agents. The BS aims to find the optimum scheduling policy that minimizes the WAoI, subject to the hard bandwidth constraint. Since this problem is NP hard, we first relax the hard constraint to a soft update rate constraint, and then compute an optimal policy for the relaxed problem by reformulating it into a Markov Decision Process (MDP). This then inspires a sub-optimal policy for the bandwidth constrained problem, which is shown to approach the optimal policy as $N \rightarrow \infty$. Next, we solve the consensus problem using the mean-field game framework wherein we first design decentralized control policies for a limiting case of the $N$-agent system (as $N \rightarrow \infty$). By explicitly constructing the mean-field system, we prove the existence and uniqueness of the mean-field equilibrium. Consequently, we show that the obtained equilibrium policies constitute an $\epsilon$-Nash equilibrium for the finite agent system. Finally, we validate the performance of both the scheduling and the control policies through numerical simulations.
翻译:在本文中, 我们考虑一个离散时间的多试剂系统, 涉及由成本相联的网络理性代理器, 解决一个共识问题和中央基地站( BS), 将代理器通信安排在网络上。 由于对网络传输次数的硬带宽限制, 最多为 $d < N$ 代理器可以同时通过网络访问它们的国家信息。 根据关于代理商和BS信息结构的标准假设, 我们首先显示代理商的控制动作没有任何双重效果, 允许将每个代理商的估算和控制问题区分开来。 其次, 我们建议对 BS 的调度问题采用加权信息年龄( WAOI), 其重量取决于代理商的估计错误。 BS 旨在找到最佳的时间安排政策, 将WAOI 限制到 硬带宽度 。 由于这个问题是硬的, 我们首先将硬性约束硬性更新利率限制硬性政策, 然后通过我们重新配置Markov 决策程序( MDP ), 以加权的硬性政策 显示我们最优性的系统 。