通过混合通道的深适应融合进行轻地面重建 (Light Field Reconstruction via Deep Adaptive Fusion of Hybrid Lenses)

This paper explores the problem of reconstructing high-resolution light field (LF) images from hybrid lenses, including a high-resolution camera surrounded by multiple low-resolution cameras. The performance of existing methods is still limited, as they produce either blurry results on plain textured areas or distortions around depth discontinuous boundaries. To tackle this challenge, we propose a novel end-to-end learning-based approach, which can comprehensively utilize the specific characteristics of the input from two complementary and parallel perspectives. Specifically, one module regresses a spatially consistent intermediate estimation by learning a deep multidimensional and cross-domain feature representation, while the other module warps another intermediate estimation, which maintains the high-frequency textures, by propagating the information of the high-resolution view. We finally leverage the advantages of the two intermediate estimations adaptively via the learned attention maps, leading to the final high-resolution LF image with satisfactory results on both plain textured areas and depth discontinuous boundaries. Besides, to promote the effectiveness of our method trained with simulated hybrid data on real hybrid data captured by a hybrid LF imaging system, we carefully design the network architecture and the training strategy. Extensive experiments on both real and simulated hybrid data demonstrate the significant superiority of our approach over state-of-the-art ones. To the best of our knowledge, this is the first end-to-end deep learning method for LF reconstruction from a real hybrid input. We believe our framework could potentially decrease the cost of high-resolution LF data acquisition and benefit LF data storage and transmission.

翻译：本文探讨了从混合镜头中重建高分辨率光场图像的问题,包括由多个低分辨率相机环绕的高分辨率相机,包括由多个低分辨率相机环绕的高分辨率相机。现有方法的性能仍然有限,因为它们在普通纹理区域产生模糊的结果,或者在深度不连续的边界周围产生扭曲。为了应对这一挑战,我们建议采用新的端对端学习方法,该方法可以全面利用从两个互补和平行角度输入的信息的具体特点。具体地说,一个模块通过学习一个深层多维和跨多面特征显示,而另一个模块扭曲了另一个维持高频纹理的中间估计。我们通过传播高分辨率视图的信息,最终利用两种中间估计的优势。我们通过学习的注意地图,导致最终高分辨率LF图像,同时在纯文本和深度不连续的边界上都取得令人满意的结果。此外,我们通过模拟混合LF成像系统采集真实混合数据,我们仔细设计了一个维持高频纹纹纹纹纹纹纹纹质的中间估计。我们仔细设计了网络结构的模型的模型,并且对低度数据进行了深入的学习,对低度数据进行了测试。