Multivariate time series forecasting poses challenges as the variables are intertwined in time and space, like in the case of traffic signals. Defining signals on graphs relaxes such complexities by representing the evolution of signals over a space using relevant graph kernels such as the heat diffusion kernel. However, this kernel alone does not fully capture the actual dynamics of the data as it only relies on the graph structure. The gap can be filled by combining the graph kernel representation with data-driven models that utilize historical data. This paper proposes a traffic propagation model that merges multiple heat diffusion kernels into a data-driven prediction model to forecast traffic signals. We optimize the model parameters using Bayesian inference to minimize the prediction errors and, consequently, determine the mixing ratio of the two approaches. Such mixing ratio strongly depends on training data size and data anomalies, which typically correspond to the peak hours for traffic data. The proposed model demonstrates prediction accuracy comparable to that of the state-of-the-art deep neural networks with lower computational effort. It notably achieves excellent performance for long-term prediction through the inheritance of periodicity modeling in data-driven models.
翻译:多变时间序列预测带来了挑战,因为变量在时间和空间上相互交织,如交通信号。在图形上确定信号,通过使用热扩散内核等相关图形内核代表空间信号的演变,从而放松了这种复杂性。然而,单靠这个内核并不能完全捕捉数据的实际动态,因为它只依靠图形结构。可以通过将图形内核代表与使用历史数据的数据驱动模型相结合来填补差距。本文建议采用交通传播模型,将多个热扩散内核结合成数据驱动预测信号的预测模型。我们利用Bayesian推论优化模型参数,以尽量减少预测错误,从而确定两种方法的混合比率。这种混合比率在很大程度上取决于培训数据大小和数据异常,通常与交通数据的高峰时数相对应。拟议模型显示的预测准确性可与利用历史数据驱动模型的先进深度神经网络相比,而且计算努力较低。通过数据驱动模型的周期性模型的继承性模型,尤其能够取得长期预测的优异性。