未知线性系统学习稳定适应性明确区别的预测控制 (Learning Stable Adaptive Explicit Differentiable Predictive Control for Unknown Linear Systems)

We present differentiable predictive control (DPC), a method for learning constrained adaptive neural control policies and dynamical models of unknown linear systems. DPC presents an approximate data-driven solution approach to the explicit Model Predictive Control (MPC) problem as a scalable alternative to computationally expensive multiparametric programming solvers. DPC is formulated as a constrained deep learning problem whose architecture is inspired by the structure of classical MPC. The optimization of the neural control policy is based on automatic differentiation of the MPC-inspired loss function through a differentiable closed-loop system model. This novel solution approach can optimize adaptive neural control policies for time-varying references while obeying state and input constraints without the prior need of an MPC controller. We show that DPC can learn to stabilize constrained neural control policies for systems with unstable dynamics. Moreover, we provide sufficient conditions for asymptotic stability of generic closed-loop system dynamics with neural feedback policies. In simulation case studies, we assess the performance of the proposed DPC method in terms of reference tracking, robustness, and computational and memory footprints compared against classical model-based and data-driven control approaches. We demonstrate that DPC scales linearly with problem size, compared to exponential scalability of classical explicit MPC based on multiparametric programming.

翻译：我们提出了不同的预测控制(DPC),这是学习受限制的适应性神经控制政策和未知线性系统动态模型的一种方法。DPC对明显的模型预测控制(MPC)问题提出了一种由数据驱动的大致解决办法,作为计算昂贵的多参数编程求解器的一种可伸缩的替代方案。DPC的制定是一个有限的深层次学习问题,其结构受到古典MPC结构的启发。神经控制政策的优化是基于通过可区分的封闭环流系统模式自动区分MPC所激发的损失功能。这种新颖的解决办法可以优化时间变化参考参考的适应性神经控制政策,同时在不事先需要MPC控制器的情况下服从状态和输入限制。我们表明DPC可以学习稳定不稳定动态系统受限制的神经控制政策。此外,我们为通用封闭式系统动态动态不受约束地稳定提供了充分的条件,并制定了神经反馈政策。在模拟案例研究中,我们评估了拟议的DPC方法在参考跟踪、稳健性、计算和存储性时间足迹方面的绩效政策,与基于经典模型的模型和指数性模型的多尺度相比,我们展示了以可比较的可动性模型和数据驱动控制。