With the development of end-to-end control based on deep learning, it is important to study new system modeling techniques to realize dynamics modeling with high-dimensional inputs. In this paper, a novel Koopman-based deep convolutional network, called CKNet, is proposed to identify latent dynamics from raw pixels. CKNet learns an encoder and decoder to play the role of the Koopman eigenfunctions and modes, respectively. The Koopman eigenvalues can be approximated by eigenvalues of the learned state transition matrix. The deterministic convolutional Koopman network (DCKNet) and the variational convolutional Koopman network (VCKNet) are proposed to span some subspace for approximating the Koopman operator respectively. Because CKNet is trained under the constraints of the Koopman theory, the identified latent dynamics is in a linear form and has good interpretability. Besides, the state transition and control matrices are trained as trainable tensors so that the identified dynamics is also time-invariant. We also design an auxiliary weight term for reducing multi-step linearity and prediction losses. Experiments were conducted on two offline trained and four online trained nonlinear forced dynamical systems with continuous action spaces in Gym and Mujoco environment respectively, and the results show that identified dynamics are adequate for approximating the latent dynamics and generating clear images. Especially for offline trained cases, this work confirms CKNet from a novel perspective that we visualize the evolutionary processes of the latent states and the Koopman eigenfunctions with DCKNet and VCKNet separately to each task based on the same episode and results demonstrate that different approaches learn similar features in shapes.
翻译:随着基于深层学习的端到端控制开发,必须研究新的系统模型技术,以高维投入实现动态模型模型。 在本文中, 提议建立一个名为 CKNet 的基于 Koopman 的新型 Koopman 深共变网络, 以分别从原始像素中找出潜在的动态。 CKNet 学习了一种编码器和解码器, 以发挥Koopman 电子元件和模式的作用。 Koopman 图像值可以通过所学的州级过渡矩阵的精度值来近似。 确定性共振动 Koopman 网络( DC KNet) 和变动共振网络网络( VC KNet 网络 ), 并提议一个名为 CKOopman 操作器操作器的子空间。 由于CKNet 在Koopman 理论的制约下, 所查明的潜伏动力动力动力动力动力学是线的直径直径直径直径直径直径直径直径直径直的。 此外, 州和控的变动阵列阵列的阵列和控控基阵列的阵列的阵列的阵列的阵列的阵列和控变动的阵列的阵列的阵列的阵列, 和控后, 也分别设计了一个连续动的阵列的模拟的连续动的阵列的连续动的阵列的阵列的阵列的阵列的阵列的阵列的阵列的K。