This paper discusses the results for the second edition of the Monocular Depth Estimation Challenge (MDEC). This edition was open to methods using any form of supervision, including fully-supervised, self-supervised, multi-task or proxy depth. The challenge was based around the SYNS-Patches dataset, which features a wide diversity of environments with high-quality dense ground-truth. This includes complex natural environments, e.g. forests or fields, which are greatly underrepresented in current benchmarks. The challenge received eight unique submissions that outperformed the provided SotA baseline on any of the pointcloud- or image-based metrics. The top supervised submission improved relative F-Score by 27.62%, while the top self-supervised improved it by 16.61%. Supervised submissions generally leveraged large collections of datasets to improve data diversity. Self-supervised submissions instead updated the network architecture and pretrained backbones. These results represent a significant progress in the field, while highlighting avenues for future research, such as reducing interpolation artifacts at depth boundaries, improving self-supervised indoor performance and overall natural image accuracy.
翻译:本文讨论了单目深度估计挑战赛 (MDEC) 的第二版结果。本版挑战赛对使用任何形式的监督方式进行估计的方法开放,包括全监督、自监督、多任务或代理深度。挑战赛基于SYNS-Patches数据集,该数据集提供了高质量稠密的地面真实数据,具有广泛的环境多样性,包括目前基准测试中很少见的复杂自然环境,例如森林或田野。挑战共收到了八个独特的提交结果,它们在任何点云或基于图像的评估指标上都优于提供的SotA基准线。最佳的全监督提交结果将相对F-Score提高了27.62%,而最佳的自监督提交将其提高了16.61%。全监督的提交结果通常利用大量的数据集收集来提高数据多样性。而自监督的提交结果则更新了网络架构和预训练的他们。这些结果代表了该领域的重大进展,同时也强调了未来研究的方向,如减少深度边界的插值伪影、提高自监督室内性能和综合自然图像的准确性。