This paper presents an open and comprehensive framework to systematically evaluate state-of-the-art contributions to self-supervised monocular depth estimation. This includes pretraining, backbone, architectural design choices and loss functions. Many papers in this field claim novelty in either architecture design or loss formulation. However, simply updating the backbone of historical systems results in relative improvements of 25%, allowing them to outperform the majority of existing systems. A systematic evaluation of papers in this field was not straightforward. The need to compare like-with-like in previous papers means that longstanding errors in the evaluation protocol are ubiquitous in the field. It is likely that many papers were not only optimized for particular datasets, but also for errors in the data and evaluation criteria. To aid future research in this area, we release a modular codebase (https://github.com/jspenmar/monodepth_benchmark), allowing for easy evaluation of alternate design decisions against corrected data and evaluation criteria. We re-implement, validate and re-evaluate 16 state-of-the-art contributions and introduce a new dataset (SYNS-Patches) containing dense outdoor depth maps in a variety of both natural and urban scenes. This allows for the computation of informative metrics in complex regions such as depth boundaries.
翻译:本文提供了一个开放和全面的框架,以系统评价对自我监督的单体深度估计的最新贡献,其中包括培训前、骨干、建筑设计选择和损失功能。许多这一领域的论文都声称建筑设计或损失配置方面的新颖性。然而,只要更新历史系统的主干系统,就相对改进25%,使它们优于现有系统。对该领域的文件进行系统化评价并非直截了当。需要与以往文件的类似之处进行比较,这意味着评价协议的长期错误在实地普遍存在。许多论文可能不仅优化特定数据集,而且还有数据和评价标准方面的错误。为了协助今后在这一领域的研究,我们发布了一个模块代码库(https://github.com/jspenmar/mono depload_benkmark),以便根据纠正的数据和评价标准,方便地评估替代设计决定。我们重新实施、验证和重新评价了16项最新贡献,并引入了一个新的数据数据集(SYNS-Pachels),不仅用于特定数据集,而且用于数据和评价标准中的错误。为了帮助今后在这一领域的研究,我们发布一个模块(http系统-paches),可以对城市深度的深度进行深密测。