Distilling analytical models from data has the potential to advance our understanding and prediction of nonlinear dynamics. Although discovery of governing equations based on observed system states (e.g., trajectory time series) has revealed success in a wide range of nonlinear dynamics, uncovering the closed-form equations directly from raw videos still remains an open challenge. To this end, we introduce a novel end-to-end unsupervised deep learning framework to uncover the mathematical structure of equations that governs the dynamics of moving objects in videos. Such an architecture consists of (1) an encoder-decoder network that learns low-dimensional spatial/pixel coordinates of the moving object, (2) a learnable Spatial-Physical Transformation component that creates mapping between the extracted spatial/pixel coordinates and the latent physical states of dynamics, and (3) a numerical integrator-based sparse regression module that uncovers the parsimonious closed-form governing equations of learned physical states and, meanwhile, serves as a constraint to the autoencoder. The efficacy of the proposed method is demonstrated by uncovering the governing equations of a variety of nonlinear dynamical systems depicted by moving objects in videos. The resulting computational framework enables discovery of parsimonious interpretable model in a flexible and accessible sensing environment where only videos are available.
翻译:从数据中蒸馏的分析模型有可能促进我们对非线性动态的了解和预测。虽然根据观察到的系统状态(例如,轨迹时间序列)发现治理方程式,显示在广泛的非线性动态中取得了成功,但从原始视频直接发现封闭式方程式仍是一个开放的挑战。为此,我们引入了一个创新的端到端的深层学习框架,以发现制约视频中移动对象动态的方程式的数学结构。这种结构包括:(1)一个以观察的系统状态(例如,轨迹时间序列)为基础,发现治理方程式网络,学习移动对象的低度空间/像素坐标;(2)一个可学习的空间-物理转变部分,在提取的空间/像素坐标和动态的潜在物理状态之间绘制地图;(3)一个基于数字的细微缩的回归模块,揭示了管理已学物理状态中移动物体动态动态的偏移式封闭式方程式,同时作为自动编码的制约。通过在可移动式图像模型中发现可移动的可移动式图像框架来展示拟议方法的功效。