Trip data that records each vehicle's trip activity on the road network describes the operation of urban traffic from the individual perspective, and it is extremely valuable for transportation research. However, restricted by data privacy, the trip data of individual-level cannot be opened for all researchers, while the need for it is very urgent. In this paper, we produce a city-scale synthetic individual-level vehicle trip dataset by generating for each individual based on the historical trip data, where the availability and trip data privacy protection are balanced. Privacy protection inevitably affects the availability of data. Therefore, we have conducted numerous experiments to demonstrate the performance and reliability of the synthetic data in different dimensions and at different granularities to help users properly judge the tasks it can perform. The result shows that the synthetic data is consistent with the real data (i.e., historical data) on the aggregated level and reasonable from the individual perspective.
翻译:记录公路网上每辆车旅行活动的特里普数据,从个人的角度描述城市交通的运行情况,对运输研究来说极为宝贵;然而,由于数据隐私的限制,无法向所有研究人员开放个人一级的旅行数据,尽管对这些数据的需求非常迫切;在本文件中,我们根据历史旅行数据为每个人编制一个城市规模的合成个人一级车辆旅行数据集,根据历史旅行数据为每个人编制数据,在数据提供和旅行数据隐私保护方面是平衡的;隐私保护不可避免地影响数据的提供;因此,我们进行了许多实验,以证明合成数据在不同层面和不同颗粒上的性能和可靠性,以帮助用户正确判断它能够完成的任务;结果显示,合成数据与个人总体水平和从个人角度看合理的真实数据(即历史数据)相一致。