Patch-NetVLAD+:在确认地点方面获得平面标语和加权匹配战略 (Patch-NetVLAD+: Learned patch descriptor and weighted matching strategy for place recognition)

Visual Place Recognition (VPR) in areas with similar scenes such as urban or indoor scenarios is a major challenge. Existing VPR methods using global descriptors have difficulty capturing local specific regions (LSR) in the scene and are therefore prone to localization confusion in such scenarios. As a result, finding the LSR that are critical for location recognition becomes key. To address this challenge, we introduced Patch-NetVLAD+, which was inspired by patch-based VPR researches. Our method proposed a fine-tuning strategy with triplet loss to make NetVLAD suitable for extracting patch-level descriptors. Moreover, unlike existing methods that treat all patches in an image equally, our method extracts patches of LSR, which present less frequently throughout the dataset, and makes them play an important role in VPR by assigning proper weights to them. Experiments on Pittsburgh30k and Tokyo247 datasets show that our approach achieved up to 6.35\% performance improvement than existing patch-based methods.

翻译：在城市或室内情景等类似场景的地区,如城市或室内情景中,视觉定位识别(VPR)是一个重大挑战。使用全球描述符的现有VPR方法很难在现场捕捉到当地特定区域,因此在这种场景中容易造成本地化混乱。因此,找到对于定位识别至关重要的LSR就成为关键。为了应对这一挑战,我们引入了受基于补丁的VPR研究启发的Patch-NetVLAD+。我们的方法提出了三重损失的微调策略,以使NetVLAD适合提取补丁描述符。此外,我们的方法与现有在图像中同等地处理所有补丁的方法不同,我们的方法提取了LSR的补丁,这些补丁在整个数据集中并不那么频繁地展示,并且通过给它们分配适当的权重,使它们在VPR中发挥重要作用。匹兹特30k和东京247数据集的实验表明,我们的方法比现有的补丁法提高了6.35 ⁇ 的绩效改进。

相关内容

声纹识别

关注 444

说话人识别（Speaker Recognition），或者称为声纹识别（Voiceprint Recognition, VPR），是根据语音中所包含的说话人个性信息，利用计算机以及现在的信息识别技术，自动鉴别说话人身份的一种生物特征识别技术。说话人识别研究的目的就是从语音中提取具有说话人表征性的特征，建立有效的模型和系统，实现自动精准的说话人鉴别。

【CVPR 2022】基于时空解耦与重耦的RGB-D动作识别 Decoupling and Recoupling Spatiotemporal Representation for RGB-D-based Motion Recognition

专知会员服务

14+阅读 · 2022年3月19日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日