Video deblurring is still an unsolved problem due to the challenging spatio-temporal modeling process. While existing convolutional neural network-based methods show a limited capacity for effective spatial and temporal modeling for video deblurring. This paper presents VDTR, an effective Transformer-based model that makes the first attempt to adapt Transformer for video deblurring. VDTR exploits the superior long-range and relation modeling capabilities of Transformer for both spatial and temporal modeling. However, it is challenging to design an appropriate Transformer-based model for video deblurring due to the complicated non-uniform blurs, misalignment across multiple frames and the high computational costs for high-resolution spatial modeling. To address these problems, VDTR advocates performing attention within non-overlapping windows and exploiting the hierarchical structure for long-range dependencies modeling. For frame-level spatial modeling, we propose an encoder-decoder Transformer that utilizes multi-scale features for deblurring. For multi-frame temporal modeling, we adapt Transformer to fuse multiple spatial features efficiently. Compared with CNN-based methods, the proposed method achieves highly competitive results on both synthetic and real-world video deblurring benchmarks, including DVD, GOPRO, REDS and BSD. We hope such a Transformer-based architecture can serve as a powerful alternative baseline for video deblurring and other video restoration tasks. The source code will be available at \url{https://github.com/ljzycmd/VDTR}.
翻译:由于具有挑战性的片段时空建模过程,视频破碎是一个尚未解决的问题。虽然基于当前神经神经网络的现有方法显示,为视频破碎进行有效空间和时间建模的有限空间和时间建模能力。本文展示了VDTR, 这是一种有效的变压器型模型,首次尝试将变压器改造为视频破碎。 VDTR利用变压器的高级长程和关系建模能力进行空间和时空建模。然而,设计一个基于变压器的视频破碎设计模型却具有挑战性。由于复杂的非统一平面平面平面平面模糊、多框架错配以及高分辨率空间建模计算成本高。为了解决这些问题,VDTR倡导在非重叠窗口内进行关注,并利用远程依赖型建模的等级结构。关于框架级空间建模,我们提议一个基于变压器的变压变压变压器,利用多尺度的变压模型的变压机变压模型。对于多框架的变压器-变压器-变压机-变压机-变压机-变压机-变压机的变压机-变压机-变压机-变压机-变换的变压机-变压机-变压-变压机-变压-变压机-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变制-变压-变压-变压-变压-变压-变压-变压-变制-变制-变制-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-变压-制-变压-