Future frame prediction has been approached through two primary methods: autoregressive and non-autoregressive. Autoregressive methods rely on the Markov assumption and can achieve high accuracy in the early stages of prediction when errors are not yet accumulated. However, their performance tends to decline as the number of time steps increases. In contrast, non-autoregressive methods can achieve relatively high performance but lack correlation between predictions for each time step. In this paper, we propose an Implicit Stacked Autoregressive Model for Video Prediction (IAM4VP), which is an implicit video prediction model that applies a stacked autoregressive method. Like non-autoregressive methods, stacked autoregressive methods use the same observed frame to estimate all future frames. However, they use their own predictions as input, similar to autoregressive methods. As the number of time steps increases, predictions are sequentially stacked in the queue. To evaluate the effectiveness of IAM4VP, we conducted experiments on three common future frame prediction benchmark datasets and weather\&climate prediction benchmark datasets. The results demonstrate that our proposed model achieves state-of-the-art performance.
翻译:未来框架预测是通过两种主要方法实现的:自动递减和非自动递减。自动递减方法依赖于Markov假设,在尚未累积错误的情况下可以在预测的早期阶段达到高度准确性。然而,随着时间步骤的增加,其性能往往会下降。相反,非自动递减方法可以达到相对较高的性能,但每个时间步骤的预测之间缺乏相关性。在本文件中,我们提出了一个隐含的视频预测自动递增模型(IMA4VP),这是一个隐含的视频预测模型,采用堆叠的自动递增方法。像非递减方法一样,堆叠自动递减方法使用同样的观察框架来估计所有未来框架。然而,它们使用自己的预测作为投入,类似于自动递减方法。随着时间步骤的增加,预测依次叠叠叠叠叠。为了评价IMA4VP的有效性,我们对三个共同的未来基准预测数据集和天气气候预报基准数据集进行了实验。结果显示,我们提议的模型实现了状态。</s>