Dense optical flow estimation is challenging when there are large displacements in a scene with heterogeneous motion dynamics, occlusion, and scene homogeneity. Traditional approaches to handle these challenges include hierarchical and multiresolution processing methods. Learning-based optical flow methods typically use a multiresolution approach with image warping when a broad range of flow velocities and heterogeneous motion is present. Accuracy of such coarse-to-fine methods is affected by the ghosting artifacts when images are warped across multiple resolutions and by the vanishing problem in smaller scene extents with higher motion contrast. Previously, we devised strategies for building compact dense prediction networks guided by the effective receptive field (ERF) characteristics of the network (DDCNet). The DDCNet design was intentionally simple and compact allowing it to be used as a building block for designing more complex yet compact networks. In this work, we extend the DDCNet strategies to handle heterogeneous motion dynamics by cascading DDCNet based sub-nets with decreasing extents of their ERF. Our DDCNet with multiresolution capability (DDCNet-Multires) is compact without any specialized network layers. We evaluate the performance of the DDCNet-Multires network using standard optical flow benchmark datasets. Our experiments demonstrate that DDCNet-Multires improves over the DDCNet-B0 and -B1 and provides optical flow estimates with accuracy comparable to similar lightweight learning-based methods.
翻译:当在具有多种运动动态、封闭性和场景同质性的场景中出现大量偏移时,对光学流进行高密度估计就具有挑战性。处理这些挑战的传统方法包括等级和多分辨率处理方法。学习基础光学流方法通常使用多分辨率方法,当存在广泛的流速和混杂运动时,图像在多分辨率上相互扭曲,以及图像在较小范围的场景中消失问题与运动对比较大时,这种粗略至细微的图案会影响到隐形文物的准确性。以前,我们设计了在网络(DDCNet-Multires)有效可接受场特性指导下建立紧凑密集的预测网络的战略。DDCNet设计有意使用简单和紧凑的方法,作为设计更复杂但又复杂的紧凑的网络。在这项工作中,我们扩大DDCNet战略,通过基于DDCNet的子网变异性动态处理,其ERF规模越来越小。我们具有多分辨率能力的DDCNet(DCNet-Multirels)是紧凑紧的,而没有使用任何专门的网络流中流-MURDMD-BSB的改进数据。我们用标准化网络的模型来评估我们以模拟网络的成绩来进行对比的改进。