Image guidance is an effective strategy for depth super-resolution. Generally, most existing methods employ hand-crafted operators to decompose the high-frequency (HF) and low-frequency (LF) ingredients from low-resolution depth maps and guide the HF ingredients by directly concatenating them with image features. However, the hand-designed operators usually cause inferior HF maps (e.g., distorted or structurally missing) due to the diverse appearance of complex depth maps. Moreover, the direct concatenation often results in weak guidance because not all image features have a positive effect on the HF maps. In this paper, we develop a recurrent structure attention guided (RSAG) framework, consisting of two important parts. First, we introduce a deep contrastive network with multi-scale filters for adaptive frequency-domain separation, which adopts contrastive networks from large filters to small ones to calculate the pixel contrasts for adaptive high-quality HF predictions. Second, instead of the coarse concatenation guidance, we propose a recurrent structure attention block, which iteratively utilizes the latest depth estimation and the image features to jointly select clear patterns and boundaries, aiming at providing refined guidance for accurate depth recovery. In addition, we fuse the features of HF maps to enhance the edge structures in the decomposed LF maps. Extensive experiments show that our approach obtains superior performance compared with state-of-the-art depth super-resolution methods.
翻译:一般来说,大多数现有方法都采用手工制作操作器,从低分辨率深度地图中分解高频和低频成分,并通过直接将高频和低频成分与图像特征相融合,引导高频成分。然而,手工设计操作器通常会因复杂深度地图的外观不同而导致低水平高频地图(例如,扭曲或结构缺失)。此外,直接连接往往导致指导不力,因为并非所有图像特征都对高频地图产生积极影响。在本文件中,我们开发了一个经常性结构引导(RSAG)框架,由两个重要部分组成。首先,我们引入了与多尺度过滤器的对比性网络,以适应性频度和基本特征进行直接隔离,将对比性网络从大型过滤器到小型过滤器,以计算适应性高质高频预测的像素对比。第二,我们建议一个经常性结构关注块,利用最新的深度估算和图像特征共同选择清晰的深度模式和图像特征,以便选择更精确的深度模型和深度的深度,目的是在更精确的深度上显示我们改进的深度的深度的模型。