Statistical methods such as the Box-Jenkins method for time-series forecasting have been prominent since their development in 1970. Many researchers rely on such models as they can be efficiently estimated and also provide interpretability. However, advances in machine learning research indicate that neural networks can be powerful data modeling techniques, as they can give higher accuracy for a plethora of learning problems and datasets. In the past, they have been tried on time-series forecasting as well, but their overall results have not been significantly better than the statistical models especially for intermediate length times series data. Their modeling capacities are limited in cases where enough data may not be available to estimate the large number of parameters that these non-linear models require. This paper presents an easy to implement data augmentation method to significantly improve the performance of such networks. Our method, Augmented-Neural-Network, which involves using forecasts from statistical models, can help unlock the power of neural networks on intermediate length time-series and produces competitive results. It shows that data augmentation, when paired with Automated Machine Learning techniques such as Neural Architecture Search, can help to find the best neural architecture for a given time-series. Using the combination of these, demonstrates significant enhancement for two configurations of our technique for a COVID-19 dataset, improving forecasting accuracy by 19.90% and 11.43%, respectively, over the neural networks that do not use augmented data.
翻译:1970年开发时序预报的Box-Jenkins方法等统计方法,自1970年开发以来就一直十分突出。许多研究人员依赖这些模型,因为它们可以高效地估计,并且提供可解释性。然而,机器学习研究的进展表明,神经网络可以成为强大的数据模型技术,因为它们可以使大量学习问题和数据集具有更高的准确性。过去,它们也是在时间序列预测方面尝试过的,但它们的总体结果并没有比统计模型(特别是中间时间序列数据)好得多。如果没有足够的数据来估计这些非线性模型所需要的大量参数,那么它们的模型能力是有限的。而如果这些非线性模型可能不具备足够的数据来估计这些参数的大量参数,那么,它们就很容易执行数据增强方法来大大改进这类网络的性能。我们的方法,即增强神经网络的精确度,这涉及到使用统计模型的预测,有助于在中间时间序列上解开神经网络的力量,并产生竞争性的结果。它表明,当它们与诸如神经结构搜索等自动化机器学习技术结合时,它们的建模能力是有限的,它们能够帮助找到最佳的神经结构结构结构结构结构结构结构结构结构,从而大大改进这类网络,从而大大改进这种系统的精确性地展示了我们的11-19级数据结构。使用这些结构。使用这些结构的扩大技术。使用这些结构的扩大了一种相当的系统结构。使用这些系统结构,用这些系统来展示了一种重要的系统来显示的扩大的扩大的系统。