In this paper, we present a novel and effective framework, named 4K-NeRF, to pursue high fidelity view synthesis on the challenging scenarios of ultra high resolutions, building on the methodology of neural radiance fields (NeRF). The rendering procedure of NeRF-based methods typically relies on a pixel wise manner in which rays (or pixels) are treated independently on both training and inference phases, limiting its representational ability on describing subtle details especially when lifting to a extremely high resolution. We address the issue by better exploring ray correlation for enhancing high-frequency details benefiting from the use of geometry-aware local context. Particularly, we use the view-consistent encoder to model geometric information effectively in a lower resolution space and recover fine details through the view-consistent decoder, conditioned on ray features and depths estimated by the encoder. Joint training with patch-based sampling further facilitates our method incorporating the supervision from perception oriented regularization beyond pixel wise loss. Quantitative and qualitative comparisons with modern NeRF methods demonstrate that our method can significantly boost rendering quality for retaining high-frequency details, achieving the state-of-the-art visual quality on 4K ultra-high-resolution scenario. Code Available at \url{https://github.com/frozoul/4K-NeRF}
翻译:在本文中,我们提出了一个名为4K-NERF的新而有效的框架,以利用神经光谱场的方法(NERF),对超高分辨率的具有挑战性的情景进行高度忠诚的合成。 NERF方法的形成程序通常依靠像素智慧的方式,在培训和推断两个阶段独立处理射线(或像素),限制其描述微妙细节的表达能力,特别是在向极高的分辨率提升时;我们通过更好地探索光谱相关关系来解决这一问题,以加强高频细节,从而从使用几何能观测当地环境中受益。特别是,我们利用视觉一致的编码器,在较低分辨率空间中有效地建模几何信息,并通过视相一致的解码器恢复细微细节,以光谱特征和深度为条件,对光谱采样进行联合培训,进一步便利我们采用从感知性正规化到像学的监管方法,超越了像素智慧损失。 与现代NERF方法的定量和定性比较表明,我们的方法可以极大地提高高分辨率质量,在高分辨率/高分辨率/高分辨率/高分辨率/高频/高分辨率/高频/高分辨率/高分辨率/高分辨率/高分辨率/高分辨率/高分辨率/高分辨率上实现。