In this work, we propose a novel safe and scalable decentralized solution for multi-agent control in the presence of stochastic disturbances. Safety is mathematically encoded using stochastic control barrier functions and safe controls are computed by solving quadratic programs. Decentralization is achieved by augmenting to each agent's optimization variables, copy variables, for its neighbors. This allows us to decouple the centralized multi-agent optimization problem. However, to ensure safety, neighboring agents must agree on "what is safe for both of us" and this creates a need for consensus. To enable safe consensus solutions, we incorporate an ADMM-based approach. Specifically, we propose a Merged CADMM-OSQP implicit neural network layer, that solves a mini-batch of both, local quadratic programs as well as the overall consensus problem, as a single optimization problem. This layer is embedded within a Deep FBSDEs network architecture at every time step, to facilitate end-to-end differentiable, safe and decentralized stochastic optimal control. The efficacy of the proposed approach is demonstrated on several challenging multi-robot tasks in simulation. By imposing requirements on safety specified by collision avoidance constraints, the safe operation of all agents is ensured during the entire training process. We also demonstrate superior scalability in terms of computational and memory savings as compared to a centralized approach.
翻译:在这项工作中,我们提出一个新的安全且可扩展的多试剂控制解决方案。安全是数学编码,使用随机控制屏障功能进行数学编码,安全控制通过解决二次程序进行计算。权力下放是通过扩大每个代理的优化变数、复制变量来实现的,这使我们能够将集中的多试剂优化问题分离出来。然而,为了确保安全,邻接代理商必须商定“对我们俩都安全什么”并由此产生共识的必要性。为了实现安全共识解决方案,我们采用了基于ADMM的ADM 方法。具体地说,我们建议采用合并的 CADMM-OSQP 隐含的神经网络层,作为单一优化问题,通过扩大每个代理商的优化变量变量和整体共识问题来解决两者的微型组合。这个层被嵌入到一个深FBSDEs网络架构中,以促进最终到最终的、安全的、分散的和分散的随机的优化控制。拟议方法的功效体现在若干具有挑战性的多核MMMM-OS隐含的神经网络层。在模拟过程中,通过安全性的安全性安全性、高压的升级的操作,也通过模拟的方式,将安全性的安全性的安全性安全性安全性安全性安全性安全性管理要求展示。