Taking photographs ''in-the-wild'' is often hindered by fence obstructions that stand between the camera user and the scene of interest, and which are hard or impossible to avoid. De-fencing is the algorithmic process of automatically removing such obstructions from images, revealing the invisible parts of the scene. While this problem can be formulated as a combination of fence segmentation and image inpainting, this often leads to implausible hallucinations of the occluded regions. Existing multi-frame approaches rely on propagating information to a selected keyframe from its temporal neighbors, but they are often inefficient and struggle with alignment of severely obstructed images. In this work we draw inspiration from the video completion literature and develop a simplified framework for multi-frame de-fencing that computes high quality flow maps directly from obstructed frames and uses them to accurately align frames. Our primary focus is efficiency and practicality in a real-world setting: the input to our algorithm is a short image burst (5 frames) - a data modality commonly available in modern smartphones - and the output is a single reconstructed keyframe, with the fence removed. Our approach leverages simple yet effective CNN modules, trained on carefully generated synthetic data, and outperforms more complicated alternatives real bursts, both quantitatively and qualitatively, while running real-time.
翻译:拍摄“ 边缘” 照片往往受到屏障障碍的阻碍,这些障碍存在于相机用户和感兴趣的场景之间,而且很难避免,也不可能避免。 屏障是自动从图像中清除这些障碍的算法过程,揭示了现场的隐形部分。 虽然这个问题可以作为栅栏分割和图像涂漆的结合而形成,但往往导致对隐蔽区域产生难以置信的幻觉。 现有的多框架方法依靠的是将信息传播到一个选定的关键框架,而这种关键框架是其时相邻的,但它们往往效率低下,而且与严重受阻的图像的对齐纠缠不休。 在这项工作中,我们从视频完成文献中汲取灵感,并开发一个简化的多框架脱钩框架框架框架框架框架框架,直接从被阻断的框架中绘制高质量的流图,并用它们来准确校准框架。 我们的主要重点是现实环境中环境中环境中的效率和实用性: 我们的算法输入是短图像爆(5个框架)—— 现代智能手机中常见的一种数据模式,产出是单一的经过仔细重建的钥匙框架,并删除了栅栏。 我们的方法是经过了真正的、 复杂、 工具的、 。