Fitfit Vid: 在像素级视频预报中超配 (FitVid: Overfitting in Pixel-Level Video Prediction)

An agent that is capable of predicting what happens next can perform a variety of tasks through planning with no additional training. Furthermore, such an agent can internally represent the complex dynamics of the real-world and therefore can acquire a representation useful for a variety of visual perception tasks. This makes predicting the future frames of a video, conditioned on the observed past and potentially future actions, an interesting task which remains exceptionally challenging despite many recent advances. Existing video prediction models have shown promising results on simple narrow benchmarks but they generate low quality predictions on real-life datasets with more complicated dynamics or broader domain. There is a growing body of evidence that underfitting on the training data is one of the primary causes for the low quality predictions. In this paper, we argue that the inefficient use of parameters in the current video models is the main reason for underfitting. Therefore, we introduce a new architecture, named FitVid, which is capable of severe overfitting on the common benchmarks while having similar parameter count as the current state-of-the-art models. We analyze the consequences of overfitting, illustrating how it can produce unexpected outcomes such as generating high quality output by repeating the training data, and how it can be mitigated using existing image augmentation techniques. As a result, FitVid outperforms the current state-of-the-art models across four different video prediction benchmarks on four different metrics.

翻译：能够预测下一步会发生什么的代理商可以通过没有额外培训的规划来完成各种任务。此外,这样的代理商可以在内部代表真实世界的复杂动态,因此可以获得对各种视觉认知任务有用的代表。这可以预测视频的未来框架,以观察到的过去和潜在的未来行动为条件,尽管最近取得了许多进展,但这是一项令人感兴趣的任务,尽管最近取得了许多进展,仍然非常具有挑战性。现有的视频预测模型在简单狭窄的基准上展示了有希望的结果,但在具有更复杂动态或更广泛域的真实生活数据集上却产生低质量预测。越来越多的证据表明,培训数据不足是低质量预测的主要原因之一。在本文件中,我们争辩说,目前视频模型中参数的使用效率低下是造成不足的主要原因。因此,我们引入了一个新的结构,名为Fit Vid,它能够严重地超编于共同基准,同时将参数与当前的最新模型相匹配。我们分析了过度配置的后果,说明它如何产生出出出出出出出出高质量产出的意外结果,例如通过重复培训结果,在四个不同的模型上,如何通过调整现有模型来降低现有四种不同的模型。

相关内容

过拟合

关注 8

过拟合，在AI领域多指机器学习得到模型太过复杂，导致在训练集上表现很好，然而在测试集上却不尽人意。过拟合（over-fitting）也称为过学习，它的直观表现是算法在训练集上表现好，但在测试集上表现不好，泛化性能差。过拟合是在模型参数拟合过程中由于训练数据包含抽样误差，在训练时复杂的模型将抽样误差也进行了拟合导致的。

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

贝叶斯网络在医疗的应用综述：过去，现在和未来 | A Comprehensive Scoping Review of Bayesian Networks in Healthcare: Past, Present and Future

专知会员服务

41+阅读 · 2020年2月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日