Global operations, such as global average pooling, are widely used in top-performance image restorers. They aggregate global information from input features along entire spatial dimensions but behave differently during training and inference in image restoration tasks: they are based on different regions, namely the cropped patches (from images) and the full-resolution images. This paper revisits global information aggregation and finds that the image-based features during inference have a different distribution than the patch-based features during training. This train-test inconsistency negatively impacts the performance of models, which is severely overlooked by previous works. To reduce the inconsistency and improve test-time performance, we propose a simple method called Test-time Local Converter (TLC). Our TLC converts global operations to local ones only during inference so that they aggregate features within local spatial regions rather than the entire large images. The proposed method can be applied to various global modules (e.g., normalization, channel and spatial attention) with negligible costs. Without the need for any fine-tuning, TLC improves state-of-the-art results on several image restoration tasks, including single-image motion deblurring, video deblurring, defocus deblurring, and image denoising. In particular, with TLC, our Restormer-Local improves the state-of-the-art result in single image deblurring from 32.92 dB to 33.57 dB on GoPro dataset. The code is available at https://github.com/megvii-research/tlc.
翻译:全球平均共享等全球操作被广泛用于顶级性能图像恢复器中,它们汇集了来自整个空间层面输入特征的全球信息,但在培训和推断图像恢复任务中表现不同:它们基于不同区域,即裁剪补丁(来自图像)和完整分辨率图像。本文回顾了全球信息汇总,发现推断过程中的图像特征的分布与培训过程中的补丁特征不同。这种测试不一致性对模型的性能产生了负面影响,而以往的工作严重忽略了这一点。为减少不一致性并改进测试时间性能,我们提议了一个简单的方法,称为Test-time 本地转换器(TLC)。我们的TLC仅在推断期间将全球业务转换为当地业务,以便将其组合成当地空间区域而不是整个大图像的特征。拟议方法可以以微不足道的成本应用于各种全球模块(例如,正常化,频道和空间关注)。不需要任何微调,TLC改进以往工作严重忽略的模型的状态。为了减少不一致,我们的一些图像恢复任务,包括单image-timeal Prour delning deblus deglus deglus, exliver degilling degal degal damalalal laus.