State-of-the-art methods for optical flow estimation rely on deep learning, which require complex sequential training schemes to reach optimal performances on real-world data. In this work, we introduce the COMBO deep network that explicitly exploits the brightness constancy (BC) model used in traditional methods. Since BC is an approximate physical model violated in several situations, we propose to train a physically-constrained network complemented with a data-driven network. We introduce a unique and meaningful flow decomposition between the physical prior and the data-driven complement, including an uncertainty quantification of the BC model. We derive a joint training scheme for learning the different components of the decomposition ensuring an optimal cooperation, in a supervised but also in a semi-supervised context. Experiments show that COMBO can improve performances over state-of-the-art supervised networks, e.g. RAFT, reaching state-of-the-art results on several benchmarks. We highlight how COMBO can leverage the BC model and adapt to its limitations. Finally, we show that our semi-supervised method can significantly simplify the training procedure.
翻译:光流估计的最新方法依赖于深层次的学习,这需要复杂的连续培训计划,才能在真实世界数据上达到最佳性能。在这项工作中,我们引入了COMBO深度网络,明确利用传统方法中使用的亮度凝聚模型。由于BC是一种在几种情况下被违反的近似物理模型,我们建议培训一个物质上封闭的网络,辅之以数据驱动网络。我们引入了一种独特的和有意义的流量分解,在物理前和数据驱动的补充之间,包括不列颠哥伦比亚模型的不确定性量化。我们获得了一个联合培训计划,以学习分解的不同组成部分,确保一种最佳合作,在受监督但也是在半受监督的环境中。实验表明COMBO可以改进最先进的网络的性能,例如RAFT,在几个基准上达到最新的结果。我们强调COMBO如何利用B模型并适应其局限性。最后,我们表明,我们的半监督方法可以大大简化培训程序。