There have been many image denoisers using deep neural networks, which outperform conventional model-based methods by large margins. Recently, self-supervised methods have attracted attention because constructing a large real noise dataset for supervised training is an enormous burden. The most representative self-supervised denoisers are based on blind-spot networks, which exclude the receptive field's center pixel. However, excluding any input pixel is abandoning some information, especially when the input pixel at the corresponding output position is excluded. In addition, a standard blind-spot network fails to reduce real camera noise due to the pixel-wise correlation of noise, though it successfully removes independently distributed synthetic noise. Hence, to realize a more practical denoiser, we propose a novel self-supervised training framework that can remove real noise. For this, we derive the theoretic upper bound of a supervised loss where the network is guided by the downsampled blinded output. Also, we design a conditional blind-spot network (C-BSN), which selectively controls the blindness of the network to use the center pixel information. Furthermore, we exploit a random subsampler to decorrelate noise spatially, making the C-BSN free of visual artifacts that were often seen in downsample-based methods. Extensive experiments show that the proposed C-BSN achieves state-of-the-art performance on real-world datasets as a self-supervised denoiser and shows qualitatively pleasing results without any post-processing or refinement.
翻译:近来,使用深度神经网络的图像降噪器已经比传统的基于模型的方法有了更好的表现。由于构建大规模真实的噪声数据集进行监督式训练是一项艰巨的任务,自监督式方法受到了人们的关注。最具代表性的自监督式降噪器是基于盲点网络的,该网络将输入场的中心像素排除在外。然而,对于那些应该在输出位置的对应输入像素被排除的情形排除了一些信息。同时,标准的盲点网络无法消除真实相机噪声,因为噪声的像素相关性。因此,为了实现一种更实用的降噪器,我们提出了一种新的自监督式训练框架,可以消除真实的噪声。为此,我们导出了监督式损失的理论上限,其中网络通过降采样的盲化输出进行了指导。此外,我们设计了一种条件盲点网络(C-BSN),它可以有选择地控制网络的盲区,以使用中心像素信息。另外,我们利用随机子采样器空间上减小噪声的相关性,使得C-BSN不会像基于降采样的方法那样出现视觉伪影。广泛的实验表明,所提出的C-BSN在真实数据集作为自监督式降噪器达到了最先进的性能,并且没有进行任何的后处理或修正即可呈现出优质的结果。