Single image desnowing is a common yet challenging task. The complex snow degradations and diverse degradation scales demand strong representation ability. In order for the desnowing network to see various snow degradations and model the context interaction of local details and global information, we propose a powerful architecture dubbed as SnowFormer. First, it performs Scale-aware Feature Aggregation in the encoder to capture rich snow information of various degradations. Second, in order to tackle with large-scale degradation, it uses a novel Context Interaction Transformer Block in the decoder, which conducts context interaction of local details and global information from previous scale-aware feature aggregation in global context interaction. And the introduction of local context interaction improves recovery of scene details. Third, we devise a Heterogeneous Feature Projection Head which progressively fuse features from both the encoder and decoder and project the refined feature into the clean image. Extensive experiments demonstrate that the proposed SnowFormer achieves significant improvements over other SOTA methods. Compared with SOTA single image desnowing method HDCW-Net, it boosts the PSNR metric by 9.2dB on the CSD testset. Moreover, it also achieves a 5.13dB increase in PSNR compared with general image restoration architecture NAFNet, which verifies the strong representation ability of our SnowFormer for snow removal task. The code is released in \url{https://github.com/Ephemeral182/SnowFormer}.
翻译:单一图像的淡化是一项共同但具有挑战性的任务。复杂的降雪退化和不同的降解规模要求强大的代表能力。为了让降雪网络看到各种降雪的退化,并模拟当地细节和全球信息的背景互动。为了让降雪网络看到各种降雪的退化和模拟当地细节和全球信息的背景互动,我们建议了一个称为“Snow Former”的强大架构。首先,它在编码器中安装了比例觉觉的特征聚合,以捕捉各种退化的丰富雪信息。第二,为了应对大规模退化,它使用一个新型的环境互动变异块在拆解器中,它使用一个新的环境变异器。在拆解器中,它进行地方细节和以前规模变异功能组合的全球信息的背景互动。为了让当地环境互动能够改善现场细节的恢复。第三,我们设计了一个超强的地变异功能投影头,把精细的功能与各种变形的变形图像投影集集集到Scial-Stual-Net上,它能提升了PSNRIS/Stual的图像变影化能力。