We present a novel dataset named as HPointLoc, specially designed for exploring capabilities of visual place recognition in indoor environment and loop detection in simultaneous localization and mapping. The loop detection sub-task is especially relevant when a robot with an on-board RGB-D camera can drive past the same place (``Point") at different angles. The dataset is based on the popular Habitat simulator, in which it is possible to generate photorealistic indoor scenes using both own sensor data and open datasets, such as Matterport3D. To study the main stages of solving the place recognition problem on the HPointLoc dataset, we proposed a new modular approach named as PNTR. It first performs an image retrieval with the Patch-NetVLAD method, then extracts keypoints and matches them using R2D2, LoFTR or SuperPoint with SuperGlue, and finally performs a camera pose optimization step with TEASER++. Such a solution to the place recognition problem has not been previously studied in existing publications. The PNTR approach has shown the best quality metrics on the HPointLoc dataset and has a high potential for real use in localization systems for unmanned vehicles. The proposed dataset and framework are publicly available: https://github.com/metra4ok/HPointLoc.
翻译:我们展示了名为HPointLoc的新颖数据集,专门设计该数据集是为了探索室内环境中视觉定位识别能力和同步定位和绘图时环比探测环比检测能力。当机载 RGB-D 相机的机器人可以在不同角度驱动同一位置(“Point”)时,环比探测子任务特别相关。该数据集以流行的生境模拟器为基础,该模拟器可以使用自己的传感器数据和开放数据集(如Meatterport3D)生成光真室内场景。为了研究在HPointLoc数据集中解决位置识别问题的主要阶段,我们提出了名为PNTR的新的模块化方法。它首先使用 Patch-NetVLAD 方法进行图像检索,然后提取关键点并用 R2D2、 LoFTR 或SuperPoint 和 SuperGlueGlue 来匹配它们,最后可以使用TEASER++。这样的地点识别问题解决方案以前没有在现有的出版物中研究过。PNTR 方法已经展示了HPOLOV/MOD 数据系统上的最佳质量衡量标准。