Crowd counting on the drone platform is an interesting topic in computer vision, which brings new challenges such as small object inference, background clutter and wide viewpoint. However, there are few algorithms focusing on crowd counting on the drone-captured data due to the lack of comprehensive datasets. To this end, we collect a large-scale dataset and organize the Vision Meets Drone Crowd Counting Challenge (VisDrone-CC2020) in conjunction with the 16th European Conference on Computer Vision (ECCV 2020) to promote the developments in the related fields. The collected dataset is formed by $3,360$ images, including $2,460$ images for training, and $900$ images for testing. Specifically, we manually annotate persons with points in each video frame. There are $14$ algorithms from $15$ institutes submitted to the VisDrone-CC2020 Challenge. We provide a detailed analysis of the evaluation results and conclude the challenge. More information can be found at the website: \url{http://www.aiskyeye.com/}.
翻译:依靠无人机平台的人群是计算机愿景中一个有趣的话题,这带来了新的挑战,如小型物体推断、背景混乱和广观观点等。然而,由于缺乏全面的数据集,很少有侧重于人群对无人机采集的数据进行计数的算法。为此,我们收集了一个大型数据集,并结合促进相关领域发展的第16次欧洲计算机愿景会议(ECCV 2020),组织了一个大型的 " 愿景满足无人机计数挑战 " (VisDrone-CC202020),以促进相关领域的发展。所收集的数据集由3,360万美元的图像组成,包括用于培训的2,460美元图像和用于测试的900美元图像。具体地说,我们为每个视频框架中各点的人人工做了说明。我们从1,500美元的机构向VisDrone-CC20挑战提交了14美元的算法。我们详细分析了评估结果并结束了挑战。更多信息见网站:http://www.aiskeye.com/}。