Recently neural scene representations have provided very impressive results for representing 3D scenes visually, however, their study and progress have mainly been limited to visualization of virtual models in computer graphics or scene reconstruction in computer vision without explicitly accounting for sensor and pose uncertainty. Using this novel scene representation in robotics applications, however, would require accounting for this uncertainty in the neural map. The aim of this paper is therefore to propose a novel method for training {\em probabilistic neural scene representations} with uncertain training data that could enable the inclusion of these representations in robotics applications. Acquiring images using cameras or depth sensors contains inherent uncertainty, and furthermore, the camera poses used for learning a 3D model are also imperfect. If these measurements are used for training without accounting for their uncertainty, then the resulting models are non-optimal, and the resulting scene representations are likely to contain artifacts such as blur and un-even geometry. In this work, the problem of uncertainty integration to the learning process is investigated by focusing on training with uncertain information in a probabilistic manner. The proposed method involves explicitly augmenting the training likelihood with an uncertainty term such that the learnt probability distribution of the network is minimized with respect to the training uncertainty. It will be shown that this leads to more accurate image rendering quality, in addition to more precise and consistent geometry. Validation has been carried out on both synthetic and real datasets showing that the proposed approach outperforms state-of-the-art methods. The results show notably that the proposed method is capable of rendering novel high-quality views even when the training data is limited.
翻译:最近神经场景展示为直观地代表3D场景提供了非常令人印象深刻的结果,然而,其研究和进展主要限于计算机图形虚拟模型的视觉化或计算机视觉场景重建的虚拟模型的视觉化,而没有明确计算传感器和造成不确定性。然而,利用机器人应用中的这种新颖场景展示,将需要对神经图中的不确定性进行核算。因此,本文件的目的是提出一种新的方法,用于培训具有概率性神经场景展示 。在这项工作中,不确定性与学习过程的整合问题通过以真实性方式侧重于具有不确定性的信息来研究。使用相机或深度传感器获取图像时含有内在的不确定性,此外,用于学习3D模型的相机也存在不完善之处。如果这些测量用于培训而没有明确考虑传感器的不确定性,那么由此产生的模型将是非最佳的,因此产生的场景展示可能会包含模糊和不精确的地质图象。在以更稳定的方式提供培训时,所提出的不确定性与学习过程的整合问题将会得到研究。 拟议的方法涉及明确增加培训可能性的可能性,甚至包括用于学习3D型模型的精确度分析,从而能够更准确地显示准确的网络的概率分布。