Time series modeling for predictive purpose has been an active research area of machine learning for many years. However, no sufficiently comprehensive and meanwhile substantive survey was offered so far. This survey strives to meet this need. A unified presentation has been adopted for entire parts of this compilation. A red thread guides the reader from time series preprocessing to forecasting. Time series decomposition is a major preprocessing task, to separate nonstationary effects (the deterministic components) from the remaining stochastic constituent, assumed to be stationary. The deterministic components are predictable and contribute to the prediction through estimations or extrapolation. Fitting the most appropriate model to the remaining stochastic component aims at capturing the relationship between past and future values, to allow prediction. We cover a sufficiently broad spectrum of models while nonetheless offering substantial methodological developments. We describe three major linear parametric models, together with two nonlinear extensions, and present five categories of nonlinear parametric models. Beyond conventional statistical models, we highlight six categories of deep neural networks appropriate for time series forecasting in nonlinear framework. Finally, we enlighten new avenues of research for time series modeling and forecasting. We also report software made publicly available for the models presented.
翻译:多年来,为预测目的进行时间序列建模一直是机器学习的一个积极研究领域,但迄今没有提供足够全面和同时具有实质意义的调查。本调查力求满足这一需要。本汇编的整个部分采用了统一的表述方式。红线线指导读者从时间序列预处理到预测。时间序列分解是一项主要的预处理任务,目的是将非静止效应(确定性组成部分)与假定为固定的其余的随机成份区分开来。确定性组成部分是可以预测的,有助于通过估计或外推进行预测。将最合适的模型适用于其余的随机成份,目的是捕捉到过去和将来价值之间的关系,以便进行预测。我们覆盖了足够广泛的模型,同时提供了大量的方法发展。我们描述了三种主要的线性参数模型,连同两个非线性扩展,并介绍了五类非线性参数模型。除了传统的统计模型外,我们还着重介绍了适合于非线性框架的时间序列预报的深神经网络的六类。最后,我们为时间序列模型提供了新的研究途径。我们还公开介绍了可供使用的软件。