The predictive advantage of combining several different predictive models is widely accepted. Particularly in time series forecasting problems, this combination is often dynamic to cope with potential non-stationary sources of variation present in the data. Despite their superior predictive performance, ensemble methods entail two main limitations: high computational costs and lack of transparency. These issues often preclude the deployment of such approaches, in favour of simpler yet more efficient and reliable ones. In this paper, we leverage the idea of model compression to address this problem in time series forecasting tasks. Model compression approaches have been mostly unexplored for forecasting. Their application in time series is challenging due to the evolving nature of the data. Further, while the literature focuses on neural networks, we apply model compression to distinct types of methods. In an extensive set of experiments, we show that compressing dynamic forecasting ensembles into an individual model leads to a comparable predictive performance and a drastic reduction in computational costs. Further, the compressed individual model with best average rank is a rule-based regression model. Thus, model compression also leads to benefits in terms of model interpretability. The experiments carried in this paper are fully reproducible.
翻译:将若干不同的预测模型结合起来的预测优势被广泛接受。特别是在时间序列预测问题中,这种组合往往具有动态性,能够应对数据中潜在的非静止变化源。尽管其预测性表现优异,但混合方法具有两个主要的局限性:高计算成本和缺乏透明度。这些问题往往妨碍采用这类方法,而采用更简单、更高效、更可靠的方法。在本文件中,我们利用模型压缩的构想来在时间序列预测任务中解决这一问题。模型压缩方法大多没有用于预测。由于数据的性质不断变化,它们在时间序列中的应用具有挑战性。此外,虽然文献侧重于神经网络,但我们将模型压缩应用于不同类型的方法。在一系列广泛的实验中,我们表明将动态预测组合成单个模型会导致类似的预测性业绩和大幅降低计算成本。此外,具有最高平均等级的压缩个人模型是一种基于规则的回归模型。因此,模型压缩也会导致模型可解释性模型的效益。在本文中进行的实验是完全可逆的。