Unsupervised optical flow estimation is especially hard near occlusions and motion boundaries and in low-texture regions. We show that additional information such as semantics and domain knowledge can help better constrain this problem. We introduce SemARFlow, an unsupervised optical flow network designed for autonomous driving data that takes estimated semantic segmentation masks as additional inputs. This additional information is injected into the encoder and into a learned upsampler that refines the flow output. In addition, a simple yet effective semantic augmentation module provides self-supervision when learning flow and its boundaries for vehicles, poles, and sky. Together, these injections of semantic information improve the KITTI-2015 optical flow test error rate from 11.80% to 8.38%. We also show visible improvements around object boundaries as well as a greater ability to generalize across datasets. Code will be made available.
翻译:无监督的光学流量估计在封闭和运动边界附近以及在低文本区域特别困难。 我们显示, 语义学和域知识等额外信息有助于更好地遏制这一问题。 我们引入了SemARFlow, 这是一个不受监督的光流网络, 用于自动驱动数据, 将估计的语义分解面遮罩作为额外输入。 额外信息被注入到编码器和精炼流输出的高级智能采集器中。 此外, 一个简单而有效的语义增强模块在学习车辆、 杆和天空的流动和界限时提供自我监督的视觉。 这些语义信息注入将KITTI-2015光流测试误率从11.80%提高到8.38%。 我们还展示了在物体边界周围可见的改进, 以及跨数据集的更普及能力。 代码将会被提供 。</s>