A recently proposed class of models attempts to learn latent dynamics from high-dimensional observations, like images, using priors informed by Hamiltonian mechanics. While these models have important potential applications in areas like robotics or autonomous driving, there is currently no good way to evaluate their performance: existing methods primarily rely on image reconstruction quality, which does not always reflect the quality of the learnt latent dynamics. In this work, we empirically highlight the problems with the existing measures and develop a set of new measures, including a binary indicator of whether the underlying Hamiltonian dynamics have been faithfully captured, which we call Symplecticity Metric or SyMetric. Our measures take advantage of the known properties of Hamiltonian dynamics and are more discriminative of the model's ability to capture the underlying dynamics than reconstruction error. Using SyMetric, we identify a set of architectural choices that significantly improve the performance of a previously proposed model for inferring latent dynamics from pixels, the Hamiltonian Generative Network (HGN). Unlike the original HGN, the new HGN++ is able to discover an interpretable phase space with physically meaningful latents on some datasets. Furthermore, it is stable for significantly longer rollouts on a diverse range of 13 datasets, producing rollouts of essentially infinite length both forward and backwards in time with no degradation in quality on a subset of the datasets.
翻译:最近提出的一组模型试图从高层次观测中学习潜伏动态,例如图像,利用汉密尔顿机械学的先期信息,从图像等高层次观测中学习潜伏动态。虽然这些模型在机器人或自主驾驶等领域具有重要的潜在应用潜力,但目前没有很好的方法来评估其性能:现有方法主要依赖图像重建质量,这并不总是反映所学到的潜伏动态的质量。在这项工作中,我们从经验上强调现有措施的问题,并制订一套新措施,包括一个二进制指标,说明基础的汉密尔顿动力是否得到忠实的捕捉,我们称之为“即时计量”或“SyMetetri”。我们的措施利用了已知的汉密尔顿动力特性,并且比重建错误更具有歧视性地运用模型捕捉基本动态动力的能力。我们利用SyMetricri,确定了一套建筑选择,大大改进了先前提出的模型从像素、汉密尔顿基因网(HGN)中推断潜值的潜值。与最初的HGNGN不同的是,新的HN+能够发现一个可解释的阶段空间,在某些数据集上具有有形的潜值。此外,在13级的滚动后期的滚动中,在极的滚动中,在极的滚动的滚动中是稳定地的滚动中,在极的滚动中,在13级的滚动中,在极的滚动中没有后退的滚动。