Statistical methods such as the Box-Jenkins method for time-series forecasting have been prominent since their development in 1970. Many researchers rely on such models as they can be efficiently estimated and also provide interpretability. However, advances in machine learning research indicate that neural networks can be powerful data modeling techniques, as they can give higher accuracy for a plethora of learning problems and datasets. In the past, they have been tried on time-series forecasting as well, but their overall results have not been significantly better than the statistical models especially for intermediate length times series data. Their modeling capacities are limited in cases where enough data may not be available to estimate the large number of parameters that these non-linear models require. This paper presents an easy to implement data augmentation method to significantly improve the performance of such networks. Our method, Augmented-Neural-Network, which involves using forecasts from statistical models, can help unlock the power of neural networks on intermediate length time-series and produces competitive results. It shows that data augmentation, when paired with Automated Machine Learning techniques such as Neural Architecture Search, can help to find the best neural architecture for a given time-series. Using the combination of these, demonstrates significant enhancement in the forecasting accuracy of three neural network-based models for a COVID-19 dataset, with a maximum improvement in forecasting accuracy by 21.41%, 24.29%, and 16.42%, respectively, over the neural networks that do not use augmented data.
翻译:1970年开发时序预报的Box-Jenkins方法等统计方法,自1970年开发以来就一直十分突出。许多研究人员依赖这些模型,因为它们可以高效地估计和提供可解释性。然而,机器学习研究的进展表明,神经网络可以成为强大的数据模型技术,因为它们可以使过多的学习问题和数据集具有更高的准确性。过去,它们也是在时间序列预测方面尝试过的,但它们的总体结果并没有比统计模型(特别是中间时间序列数据)好得多。如果没有足够的数据来估计这些非线性模型所需要的大量参数,它们的模型能力有限。然而,本文为大大改进这些网络的性能提供了方便。我们的方法,即强化神经网络,它涉及利用统计模型的预测,有助于释放中长时间序列的神经网络的力量,并产生竞争性的结果。当它们与诸如神经建筑搜索等自动化机器学习技术结合时,它们的模型的建模能力是有限的,在无法估计这些非线型模型的大量参数的情况下,可以找到最佳的神经网络增强方法,从而大大地改进了这些网络的精确性结构。使用这些系统模型,用来显示一个显著的提高的21号数据序列中的系统。使用这些组合,用来测量结构的精确性模型,用来显示一个显著的精确度结构结构结构。