Traffic bottlenecks are a set of road segments that have an unacceptable level of traffic caused by a poor balance between road capacity and traffic volume. A huge volume of trajectory data which captures real-time traffic conditions in road networks provides promising new opportunities to identify the traffic bottlenecks. In this paper, we define this problem as trajectory-driven traffic bottleneck identification: Given a road network R, a trajectory database T , find a representative set of seed edges of size K of traffic bottlenecks that influence the highest number of road segments not in the seed set. We show that this problem is NP-hard and propose a framework to find the traffic bottlenecks as follows. First, a traffic spread model is defined which represents changes in traffic volume for each road segment over time. Then, the traffic diffusion probability between two connected segments and the residual ratio of traffic volume for each segment can be computed using historical trajectory data. We then propose two different algorithmic approaches to solve the problem. The first one is a best-first algorithm BF, with an approximation ratio of 1-1/e. To further accelerate the identification process in larger datasets, we also propose a sampling-based greedy algorithm SG. Finally, comprehensive experiments using three different datasets compare and contrast various solutions, and provide insights into important efficiency and effectiveness trade-offs among the respective methods.
翻译:交通瓶颈是一系列道路瓶颈,由于公路能力和交通量之间的不平衡造成交通量不平衡,造成交通量水平令人无法接受。大量轨迹数据反映了公路网络的实时交通状况,为查明交通瓶颈提供了充满希望的新机会。在本文件中,我们将这一问题定义为由轨迹驱动的交通瓶颈识别:鉴于公路网络R,轨迹数据库T,我们提出一套具有代表性的K号交通瓶颈的种子边缘,这种边緣影响着不在种子集中的最大数量的道路段。我们表明,这一问题是难以解决的,并提出一个框架,以找出交通瓶颈如下:第一,确定交通扩展模式,表明每个路段的交通量随时间变化。然后,利用历史轨迹数据计算两个连接部分之间的交通传播概率和每个段的交通流量剩余比率。我们然后提出两种不同的算法来解决该问题。第一个是最佳的算法,其近似1-1/e比率。为了进一步加快大数据集的识别进程,我们还提议采用基于抽样的贪婪算法和不同数据方法,以比较重要的贸易效率。