Providing forecasts for ultra-long time series plays a vital role in various activities, such as investment decisions, industrial production arrangements, and farm management. This paper develops a novel distributed forecasting framework to tackle challenges associated with forecasting ultra-long time series by using the industry-standard MapReduce framework. The proposed model combination approach facilitates distributed time series forecasting by combining the local estimators of time series models delivered from worker nodes and minimizing a global loss function. In this way, instead of unrealistically assuming the data generating process (DGP) of an ultra-long time series stays invariant, we make assumptions only on the DGP of subseries spanning shorter time periods. We investigate the performance of the proposed approach with AutoRegressive Integrated Moving Average (ARIMA) models using the real data application as well as numerical simulations. Compared to directly fitting the whole data with ARIMA models, our approach results in improved forecasting accuracy and computational efficiency both in point forecasts and prediction intervals, especially for longer forecast horizons. Moreover, we explore some potential factors that may affect the forecasting performance of our approach.
翻译:对超长时间序列的预测在投资决策、工业生产安排和农场管理等各种活动中发挥着关键作用。本文件开发了一个新的分布式预测框架,以应对利用工业标准地图显示框架预测超长时间序列的挑战。拟议模型组合法将工人节点提供的时间序列模型的当地估计数与全球损失功能的最小化结合起来,从而便利分配时间序列的预测。这样,我们不是不切实际地假设超长时间序列停留在变异状态的数据产生过程(DGP),而只是对跨越较短时期的子系列的DGP作出假设。我们利用实际数据应用和数字模拟,调查拟议的自动递减综合平均移动模型(ARIMA)模型的性能。相比于直接将整个数据与ARIMA模型相匹配,我们的方法在点预测和预测间隔中提高了预测的准确性和计算效率,特别是在较长的预测前景方面。我们探索了可能影响我们方法预测业绩的一些潜在因素。