利用神经辐射场进行主动机器人3D重建的不确定性向导政策 (Uncertainty Guided Policy for Active Robotic 3D Reconstruction using Neural Radiance Fields)

In this paper, we tackle the problem of active robotic 3D reconstruction of an object. In particular, we study how a mobile robot with an arm-held camera can select a favorable number of views to recover an object's 3D shape efficiently. Contrary to the existing solution to this problem, we leverage the popular neural radiance fields-based object representation, which has recently shown impressive results for various computer vision tasks. However, it is not straightforward to directly reason about an object's explicit 3D geometric details using such a representation, making the next-best-view selection problem for dense 3D reconstruction challenging. This paper introduces a ray-based volumetric uncertainty estimator, which computes the entropy of the weight distribution of the color samples along each ray of the object's implicit neural representation. We show that it is possible to infer the uncertainty of the underlying 3D geometry given a novel view with the proposed estimator. We then present a next-best-view selection policy guided by the ray-based volumetric uncertainty in neural radiance fields-based representations. Encouraging experimental results on synthetic and real-world data suggest that the approach presented in this paper can enable a new research direction of using an implicit 3D object representation for the next-best-view problem in robot vision applications, distinguishing our approach from the existing approaches that rely on explicit 3D geometric modeling.

翻译：在本文中, 我们处理一个物体的动态机器人 3D 重建问题。特别是, 我们研究一个拥有一个手持相机的移动机器人如何能够选择一些有利的视图, 以有效恢复一个物体的 3D 形状。与目前解决这一问题的方法相反, 我们利用广受欢迎的神经光亮地基物体的表达方式, 它最近为各种计算机视觉任务展示了令人印象深刻的结果。但是, 使用这样的表达方式直接解释一个物体的3D 直径的3D 几何细节并不简单, 从而对密集的 3D 重建的下一个最佳选择问题提出挑战。本文引入了一个基于光线基体积的量不确定性估计器, 以光谱为基础的数量不确定性估计器, 来计算每个天体暗线上颜色样本重量分布的精度分布。我们证明, 3D 基本几何测量方法的不确定性是可能的, 与提议的估测仪的新观点。我们然后提出下一个最佳选择政策, 由基于光谱的 3D 实地表达方式的量不确定度模型。鼓励从每个物体的深度分析方法的实验性结果, 显示我们当前 3 的精确的3 选择的精确的的模型, 方法可以显示现有的精确的精确的地理图的模型, 的模型, 使现有选择的精确的精确的模型的模型的模型的模型使我们的现有的精确的精确的的的定位法的精确的定位法。

相关内容

三维重建

关注 1173

在计算机视觉中, 三维重建是指根据单视图或者多视图的图像重建三维信息的过程. 由于单视频的信息不完全,因此三维重建需要利用经验知识. 而多视图的三维重建(类似人的双目定位)相对比较容易, 其方法是先对摄像机进行标定, 即计算出摄像机的图象坐标系与世界坐标系的关系.然后利用多个二维图象中的信息重建出三维信息。物体三维重建是计算机辅助几何设计(CAGD)、计算机图形学(CG)、计算机动画、计算机视觉、医学图像处理、科学计算和虚拟现实、数字媒体创作等领域的共性科学问题和核心技术。在计算机内生成物体三维表示主要有两类方法。一类是使用几何建模软件通过人机交互生成人为控制下的物体三维几何模型,另一类是通过一定的手段获取真实物体的几何形状。前者实现技术已经十分成熟,现有若干软件支持,比如:3DMAX、Maya、AutoCAD、UG等等,它们一般使用具有数学表达式的曲线曲面表示几何形状。后者一般称为三维重建过程,三维重建是指利用二维投影恢复物体三维信息(形状等)的数学过程和计算机技术,包括数据获取、预处理、点云拼接和特征分析等步骤。

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【ICCV 2019 Workshop】UGLLI Face Alignment: Estimating Uncertainty with Gaussian Log-Likelihood Loss（UGLLI人脸对齐：估计不确定性与高斯对数似然损失），犹他大学 Abhinav Kumar

专知会员服务

15+阅读 · 2019年10月31日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日