We propose a novel approach to the problem of clustering hierarchically aggregated time-series data, which has remained an understudied problem though it has several commercial applications. We first group time series at each aggregated level, while simultaneously leveraging local and global information. The proposed method can cluster hierarchical time series (HTS) with different lengths and structures. For common two-level hierarchies, we employ a combined objective for local and global clustering over spaces of discrete probability measures, using Wasserstein distance coupled with Soft-DTW divergence. For multi-level hierarchies, we present a bottom-up procedure that progressively leverages lower-level information for higher-level clustering. Our final goal is to improve both the accuracy and speed of forecasts for a larger number of HTS needed for a real-world application. To attain this goal, each time series is first assigned the forecast for its cluster representative, which can be considered as a "shrinkage prior" for the set of time series it represents. Then this base forecast can be quickly fine-tuned to adjust to the specifics of that time series. We empirically show that our method substantially improves performance in terms of both speed and accuracy for large-scale forecasting tasks involving much HTS.
翻译:我们建议对按等级汇总的时间序列数据进行分组问题采取新颖的办法,尽管它具有若干商业应用,但仍然是一个研究不足的问题。我们首先在每合计一级分组时间序列,同时利用当地和全球信息。拟议的方法可以将不同长度和结构的等级时间序列(HTS)分组。对于共同的两级等级分类,我们用瓦瑟斯坦距离和软体-DTW差异等不同时间序列的离散概率测量空间对本地和全球分组采用一个综合目标。对于多层次的等级,我们提出了一个自下而上的程序,逐步利用较低层次的信息进行更高层次的分组。我们的最终目标是提高实际应用所需的更多HTS的预测的准确性和速度。为了实现这一目标,每个时间序列首先为其分组代表分配预报,该预测可被视为一组时间序列的“预先缩小”。然后,这一基础预测可以迅速进行微调,以适应该时间序列的具体内容。我们从经验上表明,我们的方法大大改进了在速度和大规模任务方面涉及精度的预测。