注意: 有效标示大型马拉松数据集, 用于环绕盒回退 (Heuristics2Annotate: Efficient Annotation of Large-Scale Marathon Dataset For Bounding Box Regression)

Annotating a large-scale in-the-wild person re-identification dataset especially of marathon runners is a challenging task. The variations in the scenarios such as camera viewpoints, resolution, occlusion, and illumination make the problem non-trivial. Manually annotating bounding boxes in such large-scale datasets is cost-inefficient. Additionally, due to crowdedness and occlusion in the videos, aligning the identity of runners across multiple disjoint cameras is a challenge. We collected a novel large-scale in-the-wild video dataset of marathon runners. The dataset consists of hours of recording of thousands of runners captured using 42 hand-held smartphone cameras and covering real-world scenarios. Due to the presence of crowdedness and occlusion in the videos, the annotation of runners becomes a challenging task. We propose a new scheme for tackling the challenges in the annotation of such large dataset. Our technique reduces the overall cost of annotation in terms of time as well as budget. We demonstrate performing fps analysis to reduce the effort and time of annotation. We investigate several annotation methods for efficiently generating tight bounding boxes. Our results prove that interpolating bounding boxes between keyframes is the most efficient method of bounding box generation amongst several other methods and is 3x times faster than the naive baseline method. We introduce a novel way of aligning the identity of runners in disjoint cameras. Our inter-camera alignment tool integrated with the state-of-the-art person re-id system proves to be sufficient and effective in the alignment of the runners across multiple cameras with non-overlapping views. Our proposed framework of annotation reduces the annotation cost of the dataset by a factor of 16x, also effectively aligning 93.64% of the runners in the cross-camera setting.

翻译：发现一个大型的动态人重新识别数据集,特别是马拉松跑跑者,这是一个具有挑战性的任务。相机视图、分辨率、封闭度和光化等情景的变异使问题变得非三角性。手工在大型数据集中注明捆绑的框是成本效率低下的。此外,由于视频的拥挤和隐蔽性,在多个脱节相机上调跑者的身份是一个挑战。我们收集了一个新型的大型螺旋摄像机,在马拉松跑跑跑者全自动视频数据集中收集了一个新的大型螺旋直线摄像机。这个数据集包括用42个手持智能手机相机记录数千名跑者记录并覆盖现实世界情景的时数。由于视频中存在拥挤和隐蔽性的框框,对跑者的说明是一项艰巨的任务。我们提出了一个新的办法,用以应对在多个脱节的相机上出现的挑战。我们的技术通过时间和预算来降低直线直线调整总体的成本。我们展示的紧凑分析,以降低努力度和最短时间的节流分析,在16个基底线的框中,我们用一个快速的方法来验证。