Removing haze from real-world images is challenging due to unpredictable weather conditions, resulting in misaligned hazy and clear image pairs. In this paper, we propose a non-aligned supervision framework that consists of three networks - dehazing, airlight, and transmission. In particular, we explore a non-alignment setting by utilizing a clear reference image that is not aligned with the hazy input image to supervise the dehazing network through a multi-scale reference loss that compares the features of the two images. Our setting makes it easier to collect hazy/clear image pairs in real-world environments, even under conditions of misalignment and shift views. To demonstrate this, we have created a new hazy dataset called "Phone-Hazy", which was captured using mobile phones in both rural and urban areas. Additionally, we present a mean and variance self-attention network to model the infinite airlight using dark channel prior as position guidance, and employ a channel attention network to estimate the three-channel transmission. Experimental results show that our framework outperforms current state-of-the-art methods in the real-world image dehazing. Phone-Hazy and code will be available at https://github.com/hello2377/NSDNet.
翻译:在本文中,我们提议了一个不结盟监督框架,由三个网络组成,即脱光、空气光和传输。特别是,我们探索了一种不匹配的设置。我们利用一个与隐暗输入图像不匹配的清晰参考图像来探索一个不结盟的设置,以通过多尺度的参考损失来监督脱色网络,将两种图像的特征进行比较。我们的设置使得在真实世界环境中,甚至在不匹配和变换观点的条件下,更容易收集模糊/清晰的图像配对。为了证明这一点,我们创建了一个称为“Phone-Hazy”的新的“Hazy”数据组,在农村和城市地区都使用了移动电话。此外,我们展示了一个与隐暗输入图像不匹配的隐含和差异的自我保护网络,用暗色频道作为定位指导,并使用一个频道关注网络来估计三声道的传输。实验结果表明,我们的框架在现实世界图像中超越了当前状态的NS23/DcommazingHe/MAL-DHASY 代码。</s>