移动式RNN:带有空间时间变化动态的视频预测灵活模型 (MotionRNN: A Flexible Model for Video Prediction with Spacetime-Varying Motions)

This paper tackles video prediction from a new dimension of predicting spacetime-varying motions that are incessantly changing across both space and time. Prior methods mainly capture the temporal state transitions but overlook the complex spatiotemporal variations of the motion itself, making them difficult to adapt to ever-changing motions. We observe that physical world motions can be decomposed into transient variation and motion trend, while the latter can be regarded as the accumulation of previous motions. Thus, simultaneously capturing the transient variation and the motion trend is the key to make spacetime-varying motions more predictable. Based on these observations, we propose the MotionRNN framework, which can capture the complex variations within motions and adapt to spacetime-varying scenarios. MotionRNN has two main contributions. The first is that we design the MotionGRU unit, which can model the transient variation and motion trend in a unified way. The second is that we apply the MotionGRU to RNN-based predictive models and indicate a new flexible video prediction architecture with a Motion Highway that can significantly improve the ability to predict changeable motions and avoid motion vanishing for stacked multiple-layer predictive models. With high flexibility, this framework can adapt to a series of models for deterministic spatiotemporal prediction. Our MotionRNN can yield significant improvements on three challenging benchmarks for video prediction with spacetime-varying motions.

翻译：本文从预测时空变换动议的新层面处理视频预测,这种变化在时空和时间上不断变化。先前的方法主要捕捉时间性转变,但忽略了运动本身复杂的时空变异,使其难以适应不断变化的运动。我们观察到,自然世界运动可以分解成短暂变异和运动趋势,而后者可以被视为以往运动的积累。因此,同时捕捉瞬时变异和运动趋势是使时空变换运动更加可预测的关键。根据这些观察,我们提出了移动RNNN框架,该框架可以捕捉运动内部的复杂变异,并适应时空变变幻情景。运动NNNN有两个主要贡献。第一,我们设计运动GRU单元,可以以统一的方式模拟瞬变动和运动趋势。第二,我们将移动GRURU应用于基于移动变变变变变变变变的预测模型,并表明一个新的灵活视频预测结构,可以大大改善可变动运动的能力,并避免运动运动在时间性变变变变变变变变的情景上改变运动。高的预测模型可以稳定地调整。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/