通过分解和等级近似化对线性多数据系统进行无模型最佳最佳控制 (Model-Free Optimal Control of Linear Multi-Agent Systems via Decomposition and Hierarchical Approximation)

Designing the optimal linear quadratic regulator (LQR) for a large-scale multi-agent system (MAS) is time-consuming since it involves solving a large-size matrix Riccati equation. The situation is further exasperated when the design needs to be done in a model-free way using schemes such as reinforcement learning (RL). To reduce this computational complexity, we decompose the large-scale LQR design problem into multiple smaller-size LQR design problems. We consider the objective function to be specified over an undirected graph, and cast the decomposition as a graph clustering problem. The graph is decomposed into two parts, one consisting of independent clusters of connected components, and the other containing edges that connect different clusters. Accordingly, the resulting controller has a hierarchical structure, consisting of two components. The first component optimizes the performance of each independent cluster by solving the smaller-size LQR design problem in a model-free way using an RL algorithm. The second component accounts for the objective coupling different clusters, which is achieved by solving a least squares problem in one shot. Although suboptimal, the hierarchical controller adheres to a particular structure as specified by inter-agent couplings in the objective function and by the decomposition strategy. Mathematical formulations are established to find a decomposition that minimizes the number of required communication links or reduces the optimality gap. Numerical simulations are provided to highlight the pros and cons of the proposed designs.

翻译：设计大型多试剂系统的最佳线性二次调节器(LQR)是耗时的,因为它涉及解决大型矩阵的大型矩阵里卡蒂方程式。当设计需要使用强化学习(RL)等方法来以无模型的方式完成时,这种情况会进一步激化。为了降低这一计算复杂性,我们将大型LQR设计问题分解成多个规模较小的LQR设计问题。我们认为目标函数是用一个无方向的图形来指定,并将分解成一个图形组合问题。图表分解成两个部分,其中一个由独立的连接不同组群的组合组成,而另一个部分则包含连接不同组群的边缘。因此,由此产生的控制器有一个等级结构,由两个部分组成。为优化每个独立组群的性能,通过一个无模型来解决规模较小的LQRQR设计问题。目标组合的第二个组成部分显示不同组群的截分,通过在一个镜头中解决一个最小的正方形问题来完成。尽管一个由独立组合组合的组合组合组合组合组成结构中,通过一个最优化的正态结构来降低一个最小的正态结构结构结构,但以最下级控制器的螺旋函数将一个最优化的组合到一个最下方形结构,通过一个最优化的正向一个正向一个正向一个正态的组合式的组合式结构结构,通过固定的递制成为最小化的螺旋后制成。