Large collections of time series data are commonly organized into structures with different levels of aggregation; examples include product and geographical groupings. It is often important to ensure that the forecasts are coherent so that the predicted values at disaggregate levels add up to the aggregate forecast. The growing interest of the Machine Learning community in hierarchical forecasting systems indicates that we are in a propitious moment to ensure that scientific endeavors are grounded on sound baselines. For this reason, we put forward the HierarchicalForecast library, which contains preprocessed publicly available datasets, evaluation metrics, and a compiled set of statistical baseline models. Our Python-based reference framework aims to bridge the gap between statistical and econometric modeling, and Machine Learning forecasting research. Code and documentation are available in https://github.com/Nixtla/hierarchicalforecast.
翻译:大量时间序列数据的收集通常由不同层次的汇总结构组成;例子包括产品和地理分组;通常必须确保预测的一致性,以便分类层次的预测值与总预测值相加;机器学习界对等级预报系统的兴趣日益浓厚,这表明我们正处在确保科学努力以健全的基线为基础的良好时刻;为此,我们提出了等级化的前瞻性图书馆,其中包括预先处理的公开数据集、评价指标和一套汇编的统计基线模型;我们的Python参考框架旨在弥合统计和计量经济学模型与机器学习预测研究之间的差距;守则和文件可在https://github.com/Nixtla/hierarchicforestricast查阅。