Aiming to restore the original intensity of shadow regions in an image and make them compatible with the remaining non-shadow regions without a trace, shadow removal is a very challenging problem that benefits many downstream image/video-related tasks. Recently, transformers have shown their strong capability in various applications by capturing global pixel interactions and this capability is highly desirable in shadow removal. However, applying transformers to promote shadow removal is non-trivial for the following two reasons: 1) The patchify operation is not suitable for shadow removal due to irregular shadow shapes; 2) shadow removal only needs one-way interaction from the non-shadow region to the shadow region instead of the common two-way interactions among all pixels in the image. In this paper, we propose a novel cross-region transformer, namely CRFormer, for shadow removal which differs from existing transformers by only considering the pixel interactions from the non-shadow region to the shadow region without splitting images into patches. This is achieved by a carefully designed region-aware cross-attention operation that can aggregate the recovered shadow region features conditioned on the non-shadow region features. Extensive experiments on ISTD, AISTD, SRD, and Video Shadow Removal datasets demonstrate the superiority of our method compared to other state-of-the-art methods.
翻译:为了在图像中恢复阴影区域的原始强度,使其与其余的非阴影区域不留痕迹地相容,阴影清除是一个非常棘手的问题,有利于许多下游图像/视频相关任务。最近,变压器通过捕捉全球像素相互作用,在各种应用中表现出强大的能力,这种能力在阴影清除中非常可取。然而,应用变压器促进阴影清除是非三角的,原因如下:1)由于阴影形状不规则,固定操作不适于清除阴影;2)光影清除只需要从非阴影区域到阴影区域的单向互动,而不是图像中所有像素之间的共同双向互动。在本文件中,我们提议采用新的跨区域变压器,即Cremonormer,用于消除阴影,因为仅考虑从非阴影区域到阴影区域的象素互动,而不将图像分解成补。这是通过精心设计的区域觉交叉保护操作实现的,它可以将非阴影区域特性的恢复的阴影区域合并为阴影区域,而不是图像区域所有象素之间的双向互动。在本文中,我们提出了一个新的跨区域变压变换器,即 Corormerormermermermermermermermermer,我们将STD数据。