Reliability plays a key role in the experience of a rail traveler. The reliability of journeys involving transfers is affected by the reliability of the transfers and the consequences of missing a transfer, as well as the possible delay of the train used to reach the destination. In this paper, we propose a flexible method to model the reliability of train journeys with any number of transfers. The method combines a transfer reliability model based on gradient boosting responsible for predicting the reliability of transfers between trains and a delay model based on probabilistic Bayesian regression, which is used to model train arrival delays. The models are trained on delay data from four Swedish train stations and evaluated on delay data from another two stations, in order to evaluate the generalization performance of the models. We show that the probabilistic delay model, which models train delays following a mixture distribution with two lognormal components, allows to much more realistically model the distribution of actual train delays compared to a standard lognormal model. Finally, we show how these models can be used together to sample the arrival delay at the final destination of the entire journey. The results indicate that the method accurately predicts the reliability for nine out of ten tested journeys. The method could be used to improve journey planners by providing reliability information to travelers. Further applications include timetable planning and transport modeling.
翻译:可靠性在铁路旅客的出行体验中起着关键作用。涉及换乘的行程可靠性受到换乘可靠性、错过换乘的后果以及用于抵达目的地的列车可能延误的影响。本文提出了一种灵活的方法来建模任意换乘次数的列车行程可靠性。该方法结合了基于梯度提升的换乘可靠性模型(负责预测列车间换乘的可靠性)和基于概率贝叶斯回归的延误模型(用于建模列车到达延误)。模型使用瑞典四个火车站的延误数据进行训练,并在另外两个车站的延误数据上进行评估,以检验模型的泛化性能。研究表明,采用双对数正态分量混合分布建模列车延误的概率延误模型,相比标准对数正态模型能更真实地反映实际列车延误的分布特征。最后,我们展示了如何将这些模型结合使用以抽样整个行程最终目的地的到达延误。结果表明,该方法对十分之九的测试行程能准确预测可靠性。该方法可用于改进行程规划系统,为旅客提供可靠性信息。进一步的应用包括时刻表规划和交通建模。