Time series data appears in a variety of applications such as smart transportation and environmental monitoring. One of the fundamental problems for time series analysis is time series forecasting. Despite the success of recent deep time series forecasting methods, they require sufficient observation of historical values to make accurate forecasting. In other words, the ratio of the output length (or forecasting horizon) to the sum of the input and output lengths should be low enough (e.g., 0.3). As the ratio increases (e.g., to 0.8), the uncertainty for the forecasting accuracy increases significantly. In this paper, we show both theoretically and empirically that the uncertainty could be effectively reduced by retrieving relevant time series as references. In the theoretical analysis, we first quantify the uncertainty and show its connections to the Mean Squared Error (MSE). Then we prove that models with references are easier to learn than models without references since the retrieved references could reduce the uncertainty. To empirically demonstrate the effectiveness of the retrieval based time series forecasting models, we introduce a simple yet effective two-stage method, called ReTime consisting of a relational retrieval and a content synthesis. We also show that ReTime can be easily adapted to the spatial-temporal time series and time series imputation settings. Finally, we evaluate ReTime on real-world datasets to demonstrate its effectiveness.
翻译:时间序列数据出现在智能运输和环境监测等各种应用中。时间序列分析的根本问题之一是时间序列预测。尽管最近深入的时间序列预测方法取得了成功,但是,它们要求对历史价值进行充分的观察,以便作出准确的预测。换句话说,产出长度(或预测地平线)与投入和产出长度之和之和的比率应该足够低(例如0.3)。随着比率的增加(例如,升至0.8),预测准确性的不确定性大大增加。在本文中,我们从理论上和经验上都表明,通过检索相关时间序列作为参考,不确定性可以有效减少。在理论分析中,我们首先量化不确定性,并显示其与中位偏差(MSE)的关联。然后,我们证明,由于检索参考的参考文献可以减少不确定性,因此,参考的模型比模型更容易学习。为了从经验上证明检索基于时间序列的预测模型的有效性,我们采用了简单而有效的两阶段方法,称为ReTime,由关联的检索和内容序列组成。我们还表明,ReTimeal能够很容易地将不确定性量化并显示其空间-时间序列。我们最后展示了时间-时间序列。