Visual Place Recognition (VPR) is the ability of a robotic platform to correctly interpret visual stimuli from its on-board cameras in order to determine whether it is currently located in a previously visited place, despite different viewpoint, illumination and appearance changes. JPEG is a widely used image compression standard that is capable of significantly reducing the size of an image at the cost of image clarity. For applications where several robotic platforms are simultaneously deployed, the visual data gathered must be transmitted remotely between each robot. Hence, JPEG compression can be employed to drastically reduce the amount of data transmitted over a communication channel, as working with limited bandwidth for VPR can be proven to be a challenging task. However, the effects of JPEG compression on the performance of current VPR techniques have not been previously studied. For this reason, this paper presents an in-depth study of JPEG compression in VPR related scenarios. We use a selection of well-established VPR techniques on well-established benchmark datasets with various amounts of compression applied. We show that by introducing compression, the VPR performance is drastically reduced, especially in the higher spectrum of compression. Moreover, this paper demonstrates how fine-tuning a CNN can be utilised as an optimisation method for JPEG compressed data to perform more consistently with the image transformations detected in extremely JPEG compressed images.
翻译:视觉位置识别(VPR)是一个机器人平台的能力,它能够正确解释其机载相机的视觉刺激,以便确定它目前是否位于先前访问过的地方,尽管观点、照明和外观变化各不相同。 JPEG是一个广泛使用的图像压缩标准,能够以图像清晰度为代价大幅缩小图像的大小。对于同时部署多个机器人平台的应用程序,所收集的视觉数据必须在每个机器人之间远程传输。因此,可以使用JPEG压缩来大幅降低通过通信频道传输的数据数量,因为可以证明与VPR有限的带宽一起工作是一项具有挑战性的任务。然而,JPEG压缩对目前VPR技术的性能的影响以前还没有研究过。为此,本文件对VPR相关情景中JPEG压缩的深度研究。我们选择了一套完善的VPR技术,用于使用各种压缩量的既定基准数据集。我们表明,通过引入压缩,VPR的性能大大降低,特别是在较高的压缩频谱范围内。此外,本文展示了JPEG对当前VPR图像的性能以更精确的方式对GER图像进行升级。</s>