控制非线性系统的深 Koopman 操作员 (Deep Koopman Operator with Control for Nonlinear Systems)

Recently Koopman operator has become a promising data-driven tool to facilitate real-time control for unknown nonlinear systems. It maps nonlinear systems into equivalent linear systems in embedding space, ready for real-time linear control methods. However, designing an appropriate Koopman embedding function remains a challenging task. Furthermore, most Koopman-based algorithms only consider nonlinear systems with linear control input, resulting in lousy prediction and control performance when the system is fully nonlinear with the control input. In this work, we propose an end-to-end deep learning framework to learn the Koopman embedding function and Koopman Operator together to alleviate such difficulties. We first parameterize the embedding function and Koopman Operator with the neural network and train them end-to-end with the K-steps loss function. Then, an auxiliary control network is augmented to encode the nonlinear state-dependent control term to model the nonlinearity in the control input. This encoded term is considered the new control variable instead to ensure linearity of the modeled system in the embedding system.We next deploy Linear Quadratic Regulator (LQR) on the linear embedding space to derive the optimal control policy and decode the actual control input from the control net. Experimental results demonstrate that our approach outperforms other existing methods, reducing the prediction error by order of magnitude and achieving superior control performance in several nonlinear dynamic systems like damping pendulum, CartPole, and the seven DOF robotic manipulator.

翻译：最近Koopman 操作器已成为一个有希望的数据驱动工具, 有利于对未知的非线性系统进行实时控制。它将非线性系统映射为嵌入空间的等效线性系统, 可供实时线性控制方法使用。但是, 设计一个合适的 Koopman 嵌入功能仍是一项艰巨的任务。此外, 多数基于 Koopman 的算法仅考虑带有线性控制输入的非线性系统, 导致当系统完全与控制输入不线性时, 导致糟糕的预测和控制性能。在此工作中, 我们提议一个端到端深的学习框架, 以学习 Koopman 嵌入功能和 Koopman 操作共同缓解这些困难。我们首先将嵌入功能和 Koopman 操作器与神经网络连接, 并用 K stepts 丢失功能来培训它们端到端端端端端端。然后, 一个辅助控制网络化的状态控制术语将非线性控制术语添加为非线性能。这个编码术语被视为新的控制变量, 以确保嵌入系统中的模型系统的直线性系统的直线性。。 we 下, 下一步将嵌化系统将高级软性控制系统演示的高级性控制方法, 以演示式的直线性控制系统演示式的软性控制方法将演示式的软性控制系统, 以演示式控制系统, 演示式的直线性控制系统, 以导到直线性控制。