This paper proposes a new depth completion method based on multi-view improved monitored distillation to generate more accurate depth maps. Based on the state-of-the-art depth completion method named ensemble distillation, we introduce an existing stereo-based model as a teacher model to improve ensemble distillation accuracy and generate a more accurate student model in training by avoiding inherent error modes of completion-based teachers as well as minimizing the reconstruction error for a given image. We also leverage multi-view depth consistency and multi-scale minimum reprojection to provide self-supervised information. These methods use the existing structure constraints to yield supervised signals for student model training without great expense on gathering ground truth information of depth. Our extensive experimental evaluation demonstrates that our proposed method can effectively improve the accuracy of baseline method of monitored distillation.
翻译:本文提出了一种基于多视角改进监督蒸馏的新型深度完形填空方法,以生成更准确的深度图。在基于集成蒸馏的最新深度完形填空方法的基础上,我们引入了一个现有的基于立体视觉的模型作为教师模型,以提高集成蒸馏的准确性,并通过避免基于完形填空教师的固有错误模式以及在给定图像的情况下最小化重构误差来生成一个更准确的学生模型。我们还利用多视角深度一致性和多尺度最小投影来提供自我监督信息。这些方法利用现有的结构约束,为学生模型的训练提供了监督信号,而无需花费大量精力收集深度的真实信息。我们的广泛实验评估表明,我们提出的方法可以有效地提高监督蒸馏基线方法的准确性。