In video denoising, the adjacent frames often provide very useful information, but accurate alignment is needed before such information can be harnassed. In this work, we present a multi-alignment network, which generates multiple flow proposals followed by attention-based averaging. It serves to mimics the non-local mechanism, suppressing noise by averaging multiple observations. Our approach can be applied to various state-of-the-art models that are based on flow estimation. Experiments on a large-scale video dataset demonstrate that our method improves the denoising baseline model by 0.2dB, and further reduces the parameters by 47% with model distillation.
翻译:在视频拆解中,相邻框架通常提供非常有用的信息,但在这类信息被撕裂之前需要准确的对齐。 在这项工作中,我们展示了一个多匹配网络,产生多个流量建议,然后以平均关注为基础。它可以模仿非本地机制,通过平均多次观测抑制噪音。我们的方法可以适用于基于流量估计的各种最先进的模型。 大规模视频数据集实验表明,我们的方法用0.2dB改进了拆开基线模型,用模型蒸馏进一步将参数减少47%。