Global Positioning Systems (GPS) have played a crucial role in various navigation applications. Nevertheless, localizing the perfect destination within the last few meters remains an important but unresolved problem. Limited by the GPS positioning accuracy, navigation systems always show users a vicinity of a destination, but not its exact location. Street view images (SVI) in maps as an immersive media technology have served as an aid to provide the physical environment for human last-meters wayfinding. However, due to the large diversity of geographic context and acquisition conditions, the captured SVI always contains various distracting objects (e.g., pedestrians and vehicles), which will distract human visual attention from efficiently finding the destination in the last few meters. To address this problem, we highlight the importance of reducing visual distraction in image-based wayfinding by proposing a saliency-guided image inpainting framework. It aims at redirecting human visual attention from distracting objects to destination-related objects for more efficient and accurate wayfinding in the last meters. Specifically, a context-aware distracting object detection method driven by deep salient object detection has been designed to extract distracting objects from three semantic levels in SVI. Then we employ a large-mask inpainting method with fast Fourier convolutions to remove the detected distracting objects. Experimental results with both qualitative and quantitative analysis show that our saliency-guided inpainting method can not only achieve great perceptual quality in street view images but also redirect the human's visual attention to focus more on static location-related objects than distracting ones. The human-based evaluation also justified the effectiveness of our method in improving the efficiency of locating the target destination.
翻译:全球定位系统(GPS)在各种导航应用中发挥了关键作用。然而,将最后几米内的完美目的地本地化仍然是一个重要但尚未解决的问题。由于全球定位系统定位精确度的限制,导航系统总是将用户显示在一个目的地附近,而不是其确切位置。地图中的街景图像(SVI)作为一种隐蔽媒体技术,有助于为人类最后几米的探测提供物理环境。然而,由于地理背景和获取条件差异很大,所捕捉的SVI总是包含各种转移方向的物体(例如行人和车辆),这将转移人类视觉注意力,使其无法有效地找到最后几米的目的地。为了解决这一问题,我们强调减少基于图像的探测方式的视觉分散的重要性,方法是提出一个显性、导导图象的图象框架。目的是将人类视觉注意力从转移到与目的地有关的物体,以便在最后几米进行更高效和准确的勘测。具体地说,由深度物体探测所驱动的、但有背景觉察力分散的物体探测方法,将转移人类视觉注意力,而不是在最后几米中有效查找目标。为了在快速的目的地分析中,我们用高级方向分析中,用快速方向分析,也用直径分析方法来将人类的轨道定位定位定位定位,从而推移移动。