Reconstructing high-fidelity 3D objects from sparse, partial observation is of crucial importance for various applications in computer vision, robotics, and graphics. While recent neural implicit modeling methods show promising results on synthetic or dense datasets, they perform poorly on real-world data that is sparse and noisy. This paper analyzes the root cause of such deficient performance of a popular neural implicit model. We discover that the limitations are due to highly complicated objectives, lack of regularization, and poor initialization. To overcome these issues, we introduce two simple yet effective modifications: (i) a deep encoder that provides a better and more stable initialization for latent code optimization; and (ii) a deep discriminator that serves as a prior model to boost the fidelity of the shape. We evaluate our approach on two real-wold self-driving datasets and show superior performance over state-of-the-art 3D object reconstruction methods.
翻译:部分观测从稀少的地方重建高不洁的 3D 对象,对于计算机视觉、机器人和图形的各种应用至关重要。虽然最近的神经隐含模型方法在合成数据集或密集数据集方面显示出有希望的结果,但它们在稀少和吵闹的现实世界数据方面表现不佳。本文分析了流行的隐含神经模型表现不良的根源。我们发现这些限制是由于高度复杂的目标、缺乏正规化和初始化不力造成的。为了克服这些问题,我们引入了两种简单而有效的修改:(一) 一种深层编码,为潜在代码优化提供更好和更加稳定的初始化;(二) 一种深层区分器,作为促进形状忠实的前一个模型。我们评估了我们关于两个真实的自我驱动数据集的方法,并展示了比最先进的3D对象重建方法的先进性。