Self-supervised denoising has attracted widespread attention due to its ability to train without clean images. However, noise in real-world scenarios is often spatially correlated, which causes many self-supervised algorithms based on the pixel-wise independent noise assumption to perform poorly on real-world images. Recently, asymmetric pixel-shuffle downsampling (AP) has been proposed to disrupt the spatial correlation of noise. However, downsampling introduces aliasing effects, and the post-processing to eliminate these effects can destroy the spatial structure and high-frequency details of the image, in addition to being time-consuming. In this paper, we systematically analyze downsampling-based methods and propose an Asymmetric Tunable Blind-Spot Network (AT-BSN) to address these issues. We design a blind-spot network with a freely tunable blind-spot size, using a large blind-spot during training to suppress local spatially correlated noise while minimizing damage to the global structure, and a small blind-spot during inference to minimize information loss. Moreover, we propose blind-spot self-ensemble and distillation of non-blind-spot network to further improve performance and reduce computational complexity. Experimental results demonstrate that our method achieves state-of-the-art results while comprehensively outperforming other self-supervised methods in terms of image texture maintaining, parameter count, computation cost, and inference time.
翻译:自控降噪由于训练时无需干净图像,因此备受关注。然而,实际场景中的噪声通常存在空间相关性,导致基于独立像素噪声假设的自控算法在真实图像上表现不佳。最近,非对称像素洗牌下采样(AP)被提出来打破噪声的空间相关性。但是,下采样会引入混叠效应,而为消除这些效应所进行的后处理可能破坏图像的空间结构和高频细节,同时会消耗大量的时间。在本文中,我们系统地分析了基于下采样的方法并提出了一个非对称可调盲区网络(AT-BSN)来解决这些问题。我们设计了一个具有自由可调盲区大小的盲区网络,利用训练期间的大盲区来抑制局部空间相关噪声且最小化对全局结构的破坏,在推理过程中利用小的盲区来尽可能地减少信息损失。此外,我们提出了盲区自集成和非盲区网络的蒸馏以进一步提高性能和减少计算复杂度。实验结果表明,我们的方法在保持图像纹理,参数数量,计算成本和推理时间方面全面优于其他自控方法,并取得了最好的性能。