The light field (LF) reconstruction is mainly confronted with two challenges, large disparity and the non-Lambertian effect. Typical approaches either address the large disparity challenge using depth estimation followed by view synthesis or eschew explicit depth information to enable non-Lambertian rendering, but rarely solve both challenges in a unified framework. In this paper, we revisit the classic LF rendering framework to address both challenges by incorporating it with advanced deep learning techniques. First, we analytically show that the essential issue behind the large disparity and non-Lambertian challenges is the aliasing problem. Classic LF rendering approaches typically mitigate the aliasing with a reconstruction filter in the Fourier domain, which is, however, intractable to implement within a deep learning pipeline. Instead, we introduce an alternative framework to perform anti-aliasing reconstruction in the image domain and analytically show comparable efficacy on the aliasing issue. To explore the full potential, we then embed the anti-aliasing framework into a deep neural network through the design of an integrated architecture and trainable parameters. The network is trained through end-to-end optimization using a peculiar training set, including regular LFs and unstructured LFs. The proposed deep learning pipeline shows a substantial superiority in solving both the large disparity and the non-Lambertian challenges compared with other state-of-the-art approaches. In addition to the view interpolation for an LF, we also show that the proposed pipeline also benefits light field view extrapolation.
翻译:光场(LF)重建主要面临两大挑战,即巨大的差异和非地中海效应。典型的方法要么利用深度估算解决巨大的差异挑战,然后进行视觉合成,或者回避清晰的深度信息,以便让非地中海人进行翻版,但很少在统一的框架内解决这两个挑战。在本文件中,我们重新审视传统的LF框架框架,通过将它纳入先进的深层次学习技术来应对这两个挑战。首先,我们分析表明,巨大的差异和非地中海挑战背后的重要问题是别名问题。典型的LF方法通常用Fourier域的重建过滤器来缓解别名,而Fourier域的重建过滤器很难在深层次的学习管道内执行。相反,我们引入了在图像域进行反诽谤重建的替代框架,并分析地表上展示了与别名问题相似的相对效力。为了探索全部潜力,我们随后通过设计一个综合的架构和可训练的参数,将反诽谤框架嵌入一个深层次的神经网络。这个网络通过最终到最优化的视野来进行训练,而这个透视则难以在深层次的学习管道内进行执行。我们提出的高层次的视野,同时展示其他高层次的视野,包括常规的LFLF和结构之间的学习。