Visual place recognition (VPR) is critical in not only localization and mapping for autonomous driving vehicles, but also in assistive navigation for the visually impaired population. To enable a long-term VPR system on a large scale, several challenges need to be addressed. First, different applications could require different image view directions, such as front views for self-driving cars while side views for the low vision people. Second, VPR in metropolitan scenes can often cause privacy concerns due to the imaging of pedestrian and vehicle identity information, calling for the need for data anonymization before VPR queries and database construction. Both factors could lead to VPR performance variations that are not well understood yet. To study their influences, we present the NYU-VPR dataset that contains more than 200,000 images over a 2km by 2km area near the New York University campus, taken within the whole year of 2016. We present benchmark results on several popular VPR algorithms showing that side views are significantly more challenging for current VPR methods while the influence of data anonymization is almost negligible, together with our hypothetical explanations and in-depth analysis.
翻译:视觉定位不仅对自主驾驶车辆的本地化和绘图至关重要,而且对视力受损人群的辅助导航也至关重要。为了能够建立长期VPR系统,需要应对若干挑战。首先,不同的应用程序可能需要不同的图像视图方向,如自驾驶汽车的前视图和低视力人群的侧视图。第二,大都市场景中的VPR往往会由于行人和车辆身份信息的成像而引起隐私问题,要求在VPR查询和数据库建设之前需要数据匿名。这两种因素都可能导致VPR性能的变异,而这种变异还不能很好地理解。为了研究它们的影响,我们提出了NYU-VPR数据集,该数据集包含20万张以上摄氏2公里至2千米的图像,是2016年全年在纽约大学校园附近拍摄的。我们介绍了几个广受欢迎的VPR算法的基准结果,显示侧面观点对目前VPR方法的挑战更大,而数据匿名的影响几乎微不足道,同时我们假设的解释和深入分析。