Optical flow is inherently a 2D search problem, and thus the computational complexity grows quadratically with respect to the search window, making large displacements matching infeasible for high-resolution images. In this paper, we propose a new method for high-resolution optical flow estimation with significantly less computation, which is achieved by factorizing 2D optical flow with 1D attention and correlation. Specifically, we first perform a 1D attention operation in the vertical direction of the target image, and then a simple 1D correlation in the horizontal direction of the attended image can achieve 2D correspondence modeling effect. The directions of attention and correlation can also be exchanged, resulting in two 3D cost volumes that are concatenated for optical flow estimation. The novel 1D formulation empowers our method to scale to very high-resolution input images while maintaining competitive performance. Extensive experiments on Sintel, KITTI and real-world 4K ($2160 \times 3840$) resolution images demonstrated the effectiveness and superiority of our proposed method.
翻译:光学流本质上是一个2D搜索问题,因此,计算的复杂性在搜索窗口方面以四倍的方式增长,使大量偏移与高分辨率图像相匹配。在本文中,我们提出了一种高分辨率光学流估计的新方法,其计算量要少得多,其实现方法是将2D光学流与1D的注意和相关性相乘。具体地说,我们首先在目标图像的垂直方向上进行1D的注意操作,然后在所观看图像的横向方向上进行简单的1D相关操作,就可以产生2D对应的建模效果。关注方向和相关性也可以相互交换,从而产生两卷3D的成本,用于光学流估计。新的1D公式使我们有能力在保持竞争性性能的同时将非常高分辨率的输入图像缩放。在Sintel、KITTI和现实世界4K上的广泛实验(2160 time 3840$)的分辨率图像证明了我们拟议方法的有效性和优越性。