Intersection is one of the most challenging scenarios for autonomous driving tasks. Due to the complexity and stochasticity, essential applications (e.g., behavior modeling, motion prediction, safety validation, etc.) at intersections rely heavily on data-driven techniques. Thus, there is an intense demand for trajectory datasets of traffic participants (TPs) in intersections. Currently, most intersections in urban areas are equipped with traffic lights. However, there is not yet a large-scale, high-quality, publicly available trajectory dataset for signalized intersections. Therefore, in this paper, a typical two-phase signalized intersection is selected in Tianjin, China. Besides, a pipeline is designed to construct a Signalized INtersection Dataset (SIND), which contains 7 hours of recording including over 13,000 TPs with 7 types. Then, the behaviors of traffic light violations in SIND are recorded. Furthermore, the SIND is also compared with other similar works. The features of the SIND can be summarized as follows: 1) SIND provides more comprehensive information, including traffic light states, motion parameters, High Definition (HD) map, etc. 2) The category of TPs is diverse and characteristic, where the proportion of vulnerable road users (VRUs) is up to 62.6% 3) Multiple traffic light violations of non-motor vehicles are shown. We believe that SIND would be an effective supplement to existing datasets and can promote related research on autonomous driving.The dataset is available online via: https://github.com/SOTIF-AVLab/SinD
 翻译:路口是自主驾驶任务最具挑战性的情景之一。由于复杂和随机性,交叉路口的基本应用(例如行为模型、运动预测、安全验证等)在很大程度上依赖数据驱动技术。因此,对交通参与者(交通参与者)在交叉路口的轨迹数据集的需求很大。目前,城市地区的大多数交叉路口都装有交通灯;然而,尚没有大规模、高质量和公开的信号化十字路口轨道数据集。因此,在本文件中,中国天津选择了典型的两阶段信号化十字路口。此外,管道的设计是要建立一个信号化的内置数据集(SIND),其中包含7小时的记录,包括7类交通参与者(TP)的13 000多个数据集。然后,SIND违反交通灯的行为与其他类似的工作相比,SIND的特点可以概括如下:1)SIND提供更全面的信息,包括交通灯光状态、运动参数、高定义(HDGGL)驱动路段的数据集(SVI),其中显示易变的SVS-L)路段的不易变数据。