Image smoothing is a fundamental low-level vision task that aims to preserve salient structures of an image while removing insignificant details. Deep learning has been explored in image smoothing to deal with the complex entanglement of semantic structures and trivial details. However, current methods neglect two important facts in smoothing: 1) naive pixel-level regression supervised by the limited number of high-quality smoothing ground-truth could lead to domain shift and cause generalization problems towards real-world images; 2) texture appearance is closely related to object semantics, so that image smoothing requires awareness of semantic difference to apply adaptive smoothing strengths. To address these issues, we propose a novel Contrastive Semantic-Guided Image Smoothing Network (CSGIS-Net) that combines both contrastive prior and semantic prior to facilitate robust image smoothing. The supervision signal is augmented by leveraging undesired smoothing effects as negative teachers, and by incorporating segmentation tasks to encourage semantic distinctiveness. To realize the proposed network, we also enrich the original VOC dataset with texture enhancement and smoothing labels, namely VOC-smooth, which first bridges image smoothing and semantic segmentation. Extensive experiments demonstrate that the proposed CSGIS-Net outperforms state-of-the-art algorithms by a large margin. Code and dataset are available at https://github.com/wangjie6866/CSGIS-Net.
翻译:图像平滑是一个基本的低层次的愿景任务,目的是保存图像的突出结构,同时删除微小细节。在图像平滑过程中探索了深层次的学习,以应对语义结构的复杂纠缠和细小细节。然而,目前的方法忽略了平滑过程中的两个重要事实:(1) 由数量有限的高质量平滑地面图解所监督的天真的像素级回归可能导致域变换,并导致对真实世界图像的普遍化问题;(2) 纹理外观与对象语义学密切相关,因此图像平滑需要对语义差异的认识,以适应性平滑的力量。为了解决这些问题,我们提议建立一个创新的对比性语义-指导图像平滑网络(CSGIS-Net)网络平滑网络(CSGIS-Net)网络,将之前的对比性和语义性结合起来,以促进稳妥的图像平滑的平滑性平滑性平滑性平滑性平滑性平滑性。通过利用不平滑性平滑性平滑性平滑性平滑性平滑性平滑性平滑性平滑性平流图像平流/网络平流/平滑性平滑性平流图像平流层平流层平流层平流层平流层平流图像的平流图图解图解图解图解图图解图解图解图解图解图解图解图解图解图解图解图解图,即S-S-S-S-S-S-S-S-S-S-Sl平流层图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图,以展示图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解图解。