Monocular depth estimation is the base task in computer vision. It has a tremendous development in the decade with the development of deep learning. But the boundary blur of the depth map is still a serious problem. Research finds the boundary blur problem is mainly caused by two factors, first, the low-level features containing boundary and structure information may loss in deeper networks during the convolution process., second, the model ignores the errors introduced by the boundary area due to the few portions of the boundary in the whole areas during the backpropagation. In order to mitigate the boundary blur problem, we focus on the above two impact factors. Firstly, we design a scene understanding module to learn the global information with low- and high-level features, and then to transform the global information to different scales with our proposed scale transform module according to the different phases in the decoder. Secondly, we propose a boundary-aware depth loss function to pay attention to the effects of the boundary's depth value. The extensive experiments show that our method can predict the depth maps with clearer boundaries, and the performance of the depth accuracy base on NYU-depth v2 and SUN RGB-D is competitive.
翻译:单心深度估计是计算机视觉的基础任务。 十年来,随着深层学习的发展,单心深度估计有了巨大的发展。 但是,深度地图的边界模糊仍然是一个严重的问题。 研究发现边界模糊的问题主要有两个因素造成:第一,含有边界和结构信息的低层特征在变迁过程中可能会在更深的网络中丢失。第二,模型忽略了由于边界在后方勘测期间由于整个区域边界的少数部分而带来的错误。为了减轻边界模糊问题,我们把重点放在上述两个影响因素上。第一,我们设计了一个场面理解模块,以学习具有低和高层次特征的全球信息,然后将全球信息转换成不同的尺度,按照我们拟议的规模变形模块在变形过程中的不同阶段进行。第二,我们提议一个边界深度损失功能,以关注边界深度价值的影响。 广泛的实验表明,我们的方法可以以更清楚的边界来预测深度地图,以及NU深度V2和SUN RGB-D的深度精确基础的性能竞争。