Evaluating predictive models is a crucial task in predictive analytics. This process is especially challenging with time series data where the observations show temporal dependencies. Several studies have analysed how different performance estimation methods compare with each other for approximating the true loss incurred by a given forecasting model. However, these studies do not address how the estimators behave for model selection: the ability to select the best solution among a set of alternatives. We address this issue and compare a set of estimation methods for model selection in time series forecasting tasks. We attempt to answer two main questions: (i) how often is the best possible model selected by the estimators; and (ii) what is the performance loss when it does not. We empirically found that the accuracy of the estimators for selecting the best solution is low, and the overall forecasting performance loss associated with the model selection process ranges from 1.2% to 2.3%. We also discovered that some factors, such as the sample size, are important in the relative performance of the estimators.
翻译:预测模型评估是预测分析中的一项关键任务。 在观测显示时间依赖性的时间序列数据中,这一过程特别具有挑战性。 一些研究分析了不同性能估计方法在接近某一预测模型造成的真正损失方面如何相互比较。 但是,这些研究并没有涉及估计者在模型选择方面的行为方式:在一组备选方法中选择最佳解决办法的能力。我们讨论了这一问题,比较了一套用于在时间序列预测任务中选择模型的估算方法。我们试图回答两个主要问题:(一) 估计者选择的最佳可能模型的频率;以及(二) 如果没有,什么是性能损失。我们从经验上发现,选择最佳解决方案的估算者的准确性很低,与模型选择过程相关的总体性能预测损失从1.2%到2.3%不等。我们还发现,一些因素,例如抽样大小,对于估计者的相对性能很重要。