The vast majority of modern consumer-grade cameras employ a rolling shutter mechanism, leading to image distortions if the camera moves during image acquisition. In this paper, we present a novel deep network to solve the generic rolling shutter correction problem with two consecutive frames. Our pipeline is symmetrically designed to predict the global shutter image corresponding to the intermediate time of these two frames, which is difficult for existing methods because it corresponds to a camera pose that differs most from the two frames. First, two time-symmetric dense undistortion flows are estimated by using well-established principles: pyramidal construction, warping, and cost volume processing. Then, both rolling shutter images are warped into a common global shutter one in the feature space, respectively. Finally, a symmetric consistency constraint is constructed in the image decoder to effectively aggregate the contextual cues of two rolling shutter images, thereby recovering the high-quality global shutter image. Extensive experiments with both synthetic and real data from public benchmarks demonstrate the superiority of our proposed approach over the state-of-the-art methods.
翻译:绝大多数现代消费者级相机都使用滚动百叶窗机制,如果相机在获取图像期间移动,则会导致图像扭曲。 在本文中,我们展示了一个新的深层次网络,用两个连续框架来解决通用滚动百叶窗纠正问题。我们的管道设计成对称,以预测与这两个框架中间时间相对应的全球百叶窗图像,这对于现有方法来说是困难的,因为它与与两个框架最不同的相机相匹配。首先,通过使用既定原则,即金字塔结构、扭曲和成本量处理,对两个时间对称密度密度不扭曲的流动进行了估计。然后,两个滚动百叶窗图像分别被扭曲成一个共同的全球百叶窗。最后,在图像解码器中构建了一个对称一致性的制约,以有效整合两个滚动百叶窗图像的背景线索,从而恢复高质量的全球百叶窗图像。从公共基准中获取的合成数据和真实数据进行的广泛实验,显示了我们所提议的方法优于现状方法。