Traffic monitoring cameras are powerful tools for traffic management and essential components of intelligent road infrastructure systems. In this paper, we present a vehicle localization and traffic scene reconstruction framework using these cameras, dubbed as CAROM, i.e., "CARs On the Map". CAROM processes traffic monitoring videos and converts them to anonymous data structures of vehicle type, 3D shape, position, and velocity for traffic scene reconstruction and replay. Through collaborating with a local department of transportation in the United States, we constructed a benchmarking dataset containing GPS data, roadside camera videos, and drone videos to validate the vehicle tracking results. On average, the localization error is approximately 0.8 m and 1.7 m within the range of 50 m and 120 m from the cameras, respectively.
翻译:交通监控摄像头是交通管理的有力工具,是智能道路基础设施系统的基本组成部分。在本文中,我们用这些被称为CAROM的相机,即“地图上的CARS”。CARM处理交通监控视频,并将其转换为车辆类型、3D形状、位置和交通现场重建和重播速度等匿名数据结构。我们与美国当地交通部门合作,建立了一个基准数据集,其中包括全球定位系统数据、路边摄像机视频和无人机视频,以验证车辆跟踪结果。平均而言,定位错误在距离摄像头50米和120米的距离内大约为0.8米和1.7米。