The availability of tourism-related big data increases the potential to improve the accuracy of tourism demand forecasting, but presents significant challenges for forecasting, including curse of dimensionality and high model complexity. A novel bagging-based multivariate ensemble deep learning approach integrating stacked autoencoders and kernel-based extreme learning machines (B-SAKE) is proposed to address these challenges in this study. By using historical tourist arrival data, economic variable data and search intensity index (SII) data, we forecast tourist arrivals in Beijing from four countries. The consistent results of multiple schemes suggest that our proposed B-SAKE approach outperforms benchmark models in terms of level accuracy, directional accuracy and even statistical significance. Both bagging and stacked autoencoder can effectively alleviate the challenges brought by tourism big data and improve the forecasting performance of the models. The ensemble deep learning model we propose contributes to tourism forecasting literature and benefits relevant government officials and tourism practitioners.
翻译:与旅游有关的大数据的提供增加了提高旅游需求预测准确性的潜力,但是对预测提出了重大挑战,包括维度和高模型复杂性的诅咒。为了应对这一挑战,建议在本研究中采用新的包状多变的多层次深层学习方法,将堆叠的自动电解器和内核的极端学习机(B-SAK)结合起来。我们利用历史旅游抵达数据、经济变量数据和搜索强度指数(SII)数据,预测来自四个国家的游客抵达北京。多种办法的一致结果表明,我们拟议的B-SAK方法在水平准确性、方向准确性、甚至统计意义方面优于基准模型。包状和堆叠的自动电算器可以有效地减轻旅游业大数据带来的挑战,改善模型的预测性能。我们提出的堆积式深学习模型有助于旅游预测文献,有利于相关政府官员和旅游业从业人员。