Time series forecasting is a crucial task in machine learning, as it has a wide range of applications including but not limited to forecasting electricity consumption, traffic, and air quality. Traditional forecasting models rely on rolling averages, vector auto-regression and auto-regressive integrated moving averages. On the other hand, deep learning and matrix factorization models have been recently proposed to tackle the same problem with more competitive performance. However, one major drawback of such models is that they tend to be overly complex in comparison to traditional techniques. In this paper, we report the results of prominent deep learning models with respect to a well-known machine learning baseline, a Gradient Boosting Regression Tree (GBRT) model. Similar to the deep neural network (DNN) models, we transform the time series forecasting task into a window-based regression problem. Furthermore, we feature-engineered the input and output structure of the GBRT model, such that, for each training window, the target values are concatenated with external features, and then flattened to form one input instance for a multi-output GBRT model. We conducted a comparative study on nine datasets for eight state-of-the-art deep-learning models that were presented at top-level conferences in the last years. The results demonstrate that the window-based input transformation boosts the performance of a simple GBRT model to levels that outperform all state-of-the-art DNN models evaluated in this paper.
翻译:在机器学习中,时间序列预测是一项关键的任务,因为其应用范围广泛,包括但不限于预测电力消耗、交通和空气质量。传统预测模型依靠滚动平均数、矢量自动反向和自动反向综合移动平均数。另一方面,最近提出了深层次学习和矩阵因素化模型,以解决同样的问题,提高业绩竞争力。然而,这些模型的一个主要缺点是,它们往往与传统技术相比过于复杂。在本文中,我们报告了在众所周知的机器学习基线、 " 渐进式推进回溯树 " (GRT) 模型方面突出的深层次学习模型的结果。与深层神经网络(DNNN)模型相似,我们把时间序列预测任务转化为基于窗口的回归问题。此外,我们根据特征设计了GBRT模型的投入和产出结构,因此,每个培训窗口的目标值都与外部特征相融合,然后被固定为多输出的GBRT纸型模型(GRT)的 " 渐进式推进树 " 模型(GBRT)模型) 。我们在9年的深度进化进化会议中进行了一项比较研究,在8年的进取结果上展示了所有状态,在8年的进取模式中展示了8年的最高进式。