In the context of time series forecasting, it is a common practice to evaluate multiple methods and choose one of these methods or an ensemble for producing the best forecasts. However, choosing among different ensembles over multiple methods remains a challenging task that undergoes a combinatorial explosion as the number of methods increases. In the context of demand forecasting or revenue forecasting, this challenge is further exacerbated by a large number of time series as well as limited historical data points available due to changing business context. Although deep learning forecasting methods aim to simultaneously forecast large collections of time series, they become challenging to apply in such scenarios due to the limited history available and might not yield desirable results. We propose a framework for forecasting short high-dimensional time series data by combining low-rank temporal matrix factorization and optimal model selection on latent time series using cross-validation. We demonstrate that forecasting the latent factors leads to significant performance gains as compared to directly applying different uni-variate models on time series. Performance has been validated on a truncated version of the M4 monthly dataset which contains time series data from multiple domains showing the general applicability of the method. Moreover, it is amenable to incorporating the analyst view of the future owing to the low number of latent factors which is usually impractical when applying forecasting methods directly to high dimensional datasets.
翻译:在时间序列预测方面,评价多种方法并选择其中一种方法或计算最佳预测的混合方法是一种常见做法,但是,在不同的组合中选择多种方法仍然是一项具有挑战性的任务,随着方法数量的增加,这种任务会发生组合爆炸;在需求预测或收入预测方面,由于业务环境的变化,大量的时间序列和有限的历史数据点使这一挑战更加严峻。虽然深层次学习预测方法的目的是同时预测大量的时间序列,但由于历史有限,可能无法产生理想的结果,因此难以在这种假设中应用这些方法。我们提出了一个框架,通过将低层次的时间矩阵乘数结合,预测短期时间序列的短时序数据,并采用最佳模型选择,随着方法的增加,在需求预测或收入预测方面,这种挑战会因大量的时间序列以及由于业务环境的变化而获得的有限历史数据点而进一步加剧。虽然深层次的学习预测方法旨在同时预测大量的时间序列,但是由于历史有限,因此难以在这种假设中应用时间序列数据,因此难以应用于这种假设。此外,在采用高层次数据时,通常会直接采用高层次的预测方法,因为高层次的预测方法是难以预测。