Given an increasingly volatile climate, the relationship between weather and transit ridership has drawn increasing interest. However, challenges stemming from spatio-temporal dependency and non-stationarity have not been fully addressed in modelling and predicting transit ridership under the influence of weather conditions especially with the traditional statistical approaches. Drawing on three-month smart card data in Brisbane, Australia, this research adopts and assesses a suite of machine-learning algorithms, i.e., random forest, eXtreme Gradient Boosting (XGBoost) and Tweedie XGBoost, to model and predict near real-time bus ridership in relation to sudden change of weather conditions. The study confirms that there indeed exists a significant level of spatio-temporal variability of weather-ridership relationship, which produces equally dynamic patterns of prediction errors. Further comparison of model performance suggests that Tweedie XGBoost outperforms the other two machine-learning algorithms in generating overall more accurate prediction outcomes in space and time. Future research may advance the current study by drawing on larger data sets and applying more advanced machine and deep-learning approaches to provide more enhanced evidence for real-time operation of transit systems.
翻译:在气候日益动荡的情况下,天气和过境骑兵之间的关系引起了越来越多的兴趣,然而,在模拟和预测天气条件影响下,特别是在传统统计方法下,在模拟和预测过境骑兵时空依赖性和不常态性方面,尚未充分解决由地表-时间依赖性和非常态性所带来的挑战。利用澳大利亚布里斯班三个月的智能卡数据,这项研究采纳和评估了一套机器学习算法,即随机森林、eXtreme Gradiential boysting (XGBoost)和Tweedie XGBoost, 以模拟和预测天气条件突然变化的近实时客车载载运。研究证实,确实存在着相当高水平的天气搭乘关系时空变异性,从而产生同样动态的预测错误模式。进一步比较表明,Tweedie XGBoust在产生总体更准确的空间和时间预测结果方面,超越了另外两种机器学习算法。未来研究可能推进目前的研究,办法是利用更大型的数据集,采用更先进的机器和深层的过渡学习方法。