Global spatial statistics, which are aggregated along entire spatial dimensions, are widely used in top-performance image restorers. For example, mean, variance in Instance Normalization (IN) which is adopted by HINet, and global average pooling (i.e. mean) in Squeeze and Excitation (SE) which is applied to MPRNet. This paper first shows that statistics aggregated on the patches-based/entire-image-based feature in the training/testing phase respectively may distribute very differently and lead to performance degradation in image restorers. It has been widely overlooked by previous works. To solve this issue, we propose a simple approach, Test-time Local Statistics Converter (TLSC), that replaces the region of statistics aggregation operation from global to local, only in the test time. Without retraining or finetuning, our approach significantly improves the image restorer's performance. In particular, by extending SE with TLSC to the state-of-the-art models, MPRNet boost by 0.65 dB in PSNR on GoPro dataset, achieves 33.31 dB, exceeds the previous best result 0.6 dB. In addition, we simply apply TLSC to the high-level vision task, i.e. semantic segmentation, and achieves competitive results. Extensive quantity and quality experiments are conducted to demonstrate TLSC solves the issue with marginal costs while significant gain. The code is available at https://github.com/megvii-research/tlsc.
翻译:在整个空间层面汇总的全球空间统计数据被广泛用于最高性能图像恢复器中,例如,在HINet(HINet)采用的情况正常化差异(IN),以及适用于MPRNet的Squeeze和Expuration(SE)全球平均集合(即平均),本文首先表明,在培训/测试阶段,基于补丁/entire-image(基于补丁/entire-image)的特征的汇总统计数据可能分布非常不同,导致图像恢复器中的性能退化。过去的工作已广泛忽略了这一点。为解决这一问题,我们提议了一个简单的方法,即测试时间本地统计转换器(IN),在测试期间取代全球到地方的统计汇总作业区域(即平均),在不进行再培训或微调的情况下,我们的方法可大大改进图像恢复器的性能。特别是,将基于TLSC(TLSC)的SE)与最新发行模型的MPRNet(0.65 dB) 图像恢复器的性能退化。 PSNSR(GPro dam) 数据集中,实现33-DB,超过以前的边际统计(TSC-SL) 质量部分, 和高额。