Amounts of historical data collected increase and business intelligence applicability with automatic forecasting of time series are in high demand. While no single time series modeling method is universal to all types of dynamics, forecasting using an ensemble of several methods is often seen as a compromise. Instead of fixing ensemble diversity and size, we propose to predict these aspects adaptively using meta-learning. Meta-learning here considers two separate random forest regression models, built on 390 time-series features, to rank 22 univariate forecasting methods and recommend ensemble size. The forecasting ensemble is consequently formed from methods ranked as the best, and forecasts are pooled using either simple or weighted average (with a weight corresponding to reciprocal rank). The proposed approach was tested on 12561 micro-economic time-series (expanded to 38633 for various forecasting horizons) of M4 competition where meta-learning outperformed Theta and Comb benchmarks by relative forecasting errors for all data types and horizons. Best overall results were achieved by weighted pooling with a symmetric mean absolute percentage error of 9.21% versus 11.05% obtained using the Theta method.
翻译:虽然没有单一的时间序列模型方法适用于所有类型的动态,但使用几种方法的组合组合往往被视为一种折中办法。我们提议不利用元学习来预测这些方面,而是用元学习来适应这些方面。元学习在这里考虑两个独立的随机森林回归模型,以390个时间序列特征为基础,将22个单词序列预测方法排在第一级,并建议共同体大小。预测元体由排名为最佳的方法组成,预测是使用简单或加权平均(重量与对等级别相对)集合的。提议的方法在M4的12561个微观经济时间序列(各种预测地平面的计算为38633)上进行了测试,在M4的竞争中,元学习超过Theta和Comb基准,通过所有数据类型和地平面的相对预测错误而实现。通过加权集合,得出最佳的总体结果是,对准平均百分比为9.21%,对11.05 %,使用Theta方法获得的绝对百分比。