Depth map super-resolution is a task with high practical application requirements in the industry. Existing color-guided depth map super-resolution methods usually necessitate an extra branch to extract high-frequency detail information from RGB image to guide the low-resolution depth map reconstruction. However, because there are still some differences between the two modalities, direct information transmission in the feature dimension or edge map dimension cannot achieve satisfactory result, and may even trigger texture copying in areas where the structures of the RGB-D pair are inconsistent. Inspired by the multi-task learning, we propose a joint learning network of depth map super-resolution (DSR) and monocular depth estimation (MDE) without introducing additional supervision labels. For the interaction of two subnetworks, we adopt a differentiated guidance strategy and design two bridges correspondingly. One is the high-frequency attention bridge (HABdg) designed for the feature encoding process, which learns the high-frequency information of the MDE task to guide the DSR task. The other is the content guidance bridge (CGBdg) designed for the depth map reconstruction process, which provides the content guidance learned from DSR task for MDE task. The entire network architecture is highly portable and can provide a paradigm for associating the DSR and MDE tasks. Extensive experiments on benchmark datasets demonstrate that our method achieves competitive performance. Our code and models are available at https://rmcong.github.io/proj_BridgeNet.html.
翻译:深度图的超级分辨率是该行业中具有高实际应用要求的任务。现有的色导深度图超级分辨率方法通常需要额外的分支,从 RGB 图像中提取高频详细信息,以指导低分辨率深度地图重建。然而,由于两种模式之间仍有一些差异,在特征维度或边缘地图维度方面直接信息传输不能取得令人满意的结果,甚至可能触发在RGB-D对结构不一致的地区复制RGB-D对结构结构不协调的纹理。在多任务学习的启发下,我们提议建立一个由深度图超分辨率和单眼深度估计(MDE)组成的联合学习网络,而不引入额外的监督标签。对于两个子网络的相互作用,我们采取了有区别的指导战略,并相应设计了两座桥梁。一个是高频关注桥(HABdg),为特征编码进程设计,学习MDE对DSR任务的高频率信息。另一个是内容指导桥(CGBgg),为深度地图重建进程设计,提供从DSR-ROB 竞争性模型中学习的内容指导,为MDE 任务提供我们高标准化的DRE 基准 任务提供我们的数据模型。