使用极强JPEG-压缩图像确认数据效率高的视觉位置</s> (Data Efficient Visual Place Recognition Using Extremely JPEG-Compressed Images)

Visual Place Recognition (VPR) is the ability of a robotic platform to correctly interpret visual stimuli from its on-board cameras in order to determine whether it is currently located in a previously visited place, despite different viewpoint, illumination and appearance changes. JPEG is a widely used image compression standard that is capable of significantly reducing the size of an image at the cost of image clarity. For applications where several robotic platforms are simultaneously deployed, the visual data gathered must be transmitted remotely between each robot. Hence, JPEG compression can be employed to drastically reduce the amount of data transmitted over a communication channel, as working with limited bandwidth for VPR can be proven to be a challenging task. However, the effects of JPEG compression on the performance of current VPR techniques have not been previously studied. For this reason, this paper presents an in-depth study of JPEG compression in VPR related scenarios. We use a selection of well-established VPR techniques on well-established benchmark datasets with various amounts of compression applied. We show that by introducing compression, the VPR performance is drastically reduced, especially in the higher spectrum of compression. Moreover, this paper demonstrates how fine-tuning a CNN can be utilised as an optimisation method for JPEG compressed data to perform more consistently with the image transformations detected in extremely JPEG compressed images.

翻译：视觉位置识别(VPR)是一个机器人平台的能力,它能够正确解释其机载相机的视觉刺激,以便确定它目前是否位于先前访问过的地方,尽管观点、照明和外观变化各不相同。 JPEG是一个广泛使用的图像压缩标准,能够以图像清晰度为代价大幅缩小图像的大小。对于同时部署多个机器人平台的应用程序,所收集的视觉数据必须在每个机器人之间远程传输。因此,可以使用JPEG压缩来大幅降低通过通信频道传输的数据数量,因为可以证明与VPR有限的带宽一起工作是一项具有挑战性的任务。然而,JPEG压缩对目前VPR技术的性能的影响以前还没有研究过。为此,本文件对VPR相关情景中JPEG压缩的深度研究。我们选择了一套完善的VPR技术,用于使用各种压缩量的既定基准数据集。我们表明,通过引入压缩,VPR的性能大大降低,特别是在较高的压缩频谱范围内。此外,本文展示了JPEG对当前VPR图像的性能以更精确的方式对GER图像进行升级。</s>

相关内容

声纹识别

关注 444

说话人识别（Speaker Recognition），或者称为声纹识别（Voiceprint Recognition, VPR），是根据语音中所包含的说话人个性信息，利用计算机以及现在的信息识别技术，自动鉴别说话人身份的一种生物特征识别技术。说话人识别研究的目的就是从语音中提取具有说话人表征性的特征，建立有效的模型和系统，实现自动精准的说话人鉴别。

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日