Distilling interpretable physical laws from videos has led to expanded interest in the computer vision community recently thanks to the advances in deep learning, but still remains a great challenge. This paper introduces an end-to-end unsupervised deep learning framework to uncover the explicit governing equations of dynamics presented by moving object(s), based on recorded videos. Instead in the pixel (spatial) coordinate system of image space, the physical law is modeled in a regressed underlying physical coordinate system where the physical states follow potential explicit governing equations. A numerical integrator-based sparse regression module is designed and serves as a physical constraint to the autoencoder and coordinate system regression, and, in the meanwhile, uncover the parsimonious closed-form governing equations from the learned physical states. Experiments on simulated dynamical scenes show that the proposed method is able to distill closed-form governing equations and simultaneously identify unknown excitation input for several dynamical systems recorded by videos, which fills in the gap in literature where no existing methods are available and applicable for solving this type of problem.
翻译:由于深层学习的进步,从视频中蒸馏可解释的物理法则最近引起了对计算机视觉界的兴趣,这导致对计算机视觉界的兴趣增加,但仍是一个巨大的挑战。本文件引入了一个端到端的、不受监督的深层次学习框架,以根据录制的视频揭示移动对象所呈现的动态的清晰的治理方程式。在图像空间像素(空间)协调系统中,物理法则建模于一个倒退的物理协调系统,物理状态遵循着潜在的清晰治理方程式。一个基于数字集成器的稀薄回归模块被设计成并起到对自动编码器和协调系统回归的物理制约作用,同时从学习的物理状态中揭示出关于动态方程式的模糊的封闭方程式。模拟动态场实验显示,拟议的方法能够蒸发封闭式的方程式管理方程式,同时为通过视频记录的若干动态系统找出未知的引力投入,这些系统填补了文献中的空白,而目前没有可用的方法,可用于解决这类类型的问题。