We present DistillFlow, a knowledge distillation approach to learning optical flow. DistillFlow trains multiple teacher models and a student model, where challenging transformations are applied to the input of the student model to generate hallucinated occlusions as well as less confident predictions. Then, a self-supervised learning framework is constructed: confident predictions from teacher models are served as annotations to guide the student model to learn optical flow for those less confident predictions. The self-supervised learning framework enables us to effectively learn optical flow from unlabeled data, not only for non-occluded pixels, but also for occluded pixels. DistillFlow achieves state-of-the-art unsupervised learning performance on both KITTI and Sintel datasets. Our self-supervised pre-trained model also provides an excellent initialization for supervised fine-tuning, suggesting an alternate training paradigm in contrast to current supervised learning methods that highly rely on pre-training on synthetic data. At the time of writing, our fine-tuned models ranked 1st among all monocular methods on the KITTI 2015 benchmark, and outperform all published methods on the Sintel Final benchmark. More importantly, we demonstrate the generalization capability of DistillFlow in three aspects: framework generalization, correspondence generalization and cross-dataset generalization.
翻译:我们提出了“蒸馏法 ”, 这是一种知识蒸馏法,用于学习光学流。 蒸馏法培训了多种教师模型和学生模型,其中对学生模型的投入应用了具有挑战性的转变,以产生幻觉隔离和不那么自信的预测。 然后,建立了一个自我监督的学习框架:教师模型的自信预测被作为指导学生模型为那些不自信的预测学习光学流的注释。 自我监督的学习框架使我们能够有效地从未标记的数据中学习光学流,不仅针对非隐蔽的像素,而且针对隐蔽的像素。 蒸馏法在KITTI和Sintel数据集中都实现了最先进的不受监督的学习业绩。 我们自我监督的预先培训模型也为监管的微调提供了很好的初始化工具。 与当前高度依赖合成数据培训的受监督的学习方法相比, 在撰写时,我们经过精细调整的模型在KITTI和Sintelvil通用框架的所有单层方法中排名第1位。 展示了2015年总基准和整个SITF标准化, 展示了所有Sintalstal-stalalalalalal化总基准, 在2015年总基准框架中, 演示了所有Sintalmagrodustrismex axxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx