Neural Radiance Field (NeRF) has broken new ground in the novel view synthesis due to its simple concept and state-of-the-art quality. However, it suffers from severe performance degradation unless trained with a dense set of images with different camera poses, which hinders its practical applications. Although previous methods addressing this problem achieved promising results, they relied heavily on the additional training resources, which goes against the philosophy of sparse-input novel-view synthesis pursuing the training efficiency. In this work, we propose MixNeRF, an effective training strategy for novel view synthesis from sparse inputs by modeling a ray with a mixture density model. Our MixNeRF estimates the joint distribution of RGB colors along the ray samples by modeling it with mixture of distributions. We also propose a new task of ray depth estimation as a useful training objective, which is highly correlated with 3D scene geometry. Moreover, we remodel the colors with regenerated blending weights based on the estimated ray depth and further improves the robustness for colors and viewpoints. Our MixNeRF outperforms other state-of-the-art methods in various standard benchmarks with superior efficiency of training and inference.
翻译:新的视觉合成(NeRF)因其简单的概念和最先进的质量,在新观点合成中打破了新局面;然而,它由于简单的概念和最先进的质量,在新观点合成中打破了新局面;但是,它由于没有经过使用不同照相机的密集图像的训练,从而阻碍其实际应用,它遭受了严重的性能退化;尽管以前处理该问题的方法取得了可喜的成果,但它们严重依赖额外的培训资源,这违背了追求培训效率的稀疏投入新观点合成理论;在这项工作中,我们提出MixNeRF, 这是一种有效的培训战略,通过以混合密度模型模拟射线,从稀少的输入中进行新观点合成。我们的MixNeRF通过以分布的混合组合模型来估计RGB颜色与射线样品的混合分布,来估计RGB颜色在射线样本中的联合分布。我们还提出一个新的射线深度估计任务,作为有用的培训目标,它与3D场景的几度测量非常相关。此外,我们根据估计的射线深度再生成的混合重量来调整颜色,进一步提高颜色和观点的强度。我们的MixNERF在各种标准标准中比高效率培训中的其他状态方法要优。