任意-Size线性网络深层结构小组:分散估计、最佳控制和分离原则 (Deep Structured Teams in Arbitrary-Size Linear Networks: Decentralized Estimation, Optimal Control and Separation Principle)

In this article, we introduce decentralized Kalman filters for linear quadratic deep structured teams. The agents in deep structured teams are coupled in dynamics, costs and measurements through a set of linear regressions of the states and actions (also called deep states and deep actions). The information structure is decentralized, where every agent observes a noisy measurement of its local state and the global deep state. Since the number of agents is often very large in deep structured teams, any naive approach to finding an optimal Kalman filter suffers from the curse of dimensionality. Moreover, due to the decentralized nature of information structure, the resultant optimization problem is non-convex, in general, where non-linear strategies can outperform linear ones. However, we prove that the optimal strategy is linear in the local state estimate as well as the deep state estimate and can be efficiently computed by two scale-free Riccati equations and Kalman filters. We propose a bi-level orthogonal approach across both space and time levels based on a gauge transformation technique to achieve the above result. We also establish a separation principle between optimal control and optimal estimation. Furthermore, we show that as the number of agents goes to infinity, the Kalman gain associated with the deep state estimate converges to zero at a rate inversely proportional to the number of agents. This leads to a fully decentralized approximate strategy where every agent predicts the deep state by its conditional and unconditional expected value, also known as the certainty equivalence approximation and (weighted) mean-field approximation, respectively.

翻译：在本篇文章中,我们为线形深层结构化团队引入了分散的卡尔曼过滤器。深层结构化团队中的代理商通过一系列州和行动(也称为深层状态和深层行动)的线性回归,在动态、成本和测量方面相互结合。信息结构是分散的,每个代理商都观察到对当地状况和全球深层状态进行噪音的测量。由于深层结构化团队中的代理商数量通常非常多,因此找到最佳卡尔曼过滤器的任何天真的方法都受到维度的诅咒。此外,由于信息结构的分散性质,由此产生的优化问题一般是非康韦克斯,非线性战略可以超过线性战略。然而,我们证明最佳战略是本地州估算的线性以及深度状态估计的线性,可以通过两个无规模的里卡提方程方程式和卡尔曼过滤器进行高效的计算。我们提议在空间和时间层次上采用双级或级的双级方法,实现上述结果。我们还确定了最佳控制与优化估算之间的原则。此外,在非线性战略中,我们证明最佳战略的线性战略是直线性战略的线性线性线性,也就是性,并且表明,每一级代理商的精度的精度的精度的精度水平水平值将最终的精度水平的精确率率率率率值分别推到最接近值。我们所理解。