Photometric consistency loss is one of the representative objective functions commonly used for self-supervised monocular depth estimation. However, this loss often causes unstable depth predictions in textureless or occluded regions due to incorrect guidance. Recent self-supervised learning approaches tackle this issue by utilizing feature representations explicitly learned from auto-encoders, expecting better discriminability than the input image. Despite the use of auto-encoded features, we observe that the method does not embed features as discriminative as auto-encoded features. In this paper, we propose residual guidance loss that enables the depth estimation network to embed the discriminative feature by transferring the discriminability of auto-encoded features. We conducted experiments on the KITTI benchmark and verified our method's superiority and orthogonality on other state-of-the-art methods.
翻译:光度一致性损失是自我监督单层深度估计通常使用的代表性客观功能之一,但这一损失往往由于指导不正确而在无纹理或隐蔽区域造成不稳的深度预测。最近自我监督的学习方法通过利用从自动编码器中明确学到的特征表征来解决这一问题,期望比输入图像更具有差异性。尽管使用自动编码的特征,但我们认为该方法并未包含像自动编码特征那样具有歧视性的特征。在本文中,我们提议了剩余指导损失,使深度估计网络能够通过转移自动编码特征的可分性来嵌入歧视特征。我们进行了关于KITTI基准的实验,并核实了我们方法的优越性和与其他最新方法的异性。