Modelling individual objects as Neural Radiance Fields (NeRFs) within a robotic context can benefit many downstream tasks such as scene understanding and object manipulation. However, real-world training data collected by a robot deviate from the ideal in several key aspects. (i) The trajectories are constrained and full visual coverage is not guaranteed - especially when obstructions are present. (ii) The poses associated with the images are noisy. (iii) The objects are not easily isolated from the background. This paper addresses the above three points and uses the outputs of an object-based SLAM system to bound objects in the scene with coarse primitives and - in concert with instance masks - identify obstructions in the training images. Objects are therefore automatically bounded, and non-relevant geometry is excluded from the NeRF representation. The method's performance is benchmarked under ideal conditions and tested against errors in the poses and instance masks. Our results show that object-based NeRFs are robust to pose variations but sensitive to the quality of the instance masks.
翻译:在机器人环境下将单个物体建模为神经辐射场(神经辐射场)可有益于许多下游任务,如现场理解和物体操纵等,然而,机器人收集的真实世界培训数据在若干关键方面偏离理想。 (一) 轨道受到限制,无法保证全部视觉覆盖,特别是在存在障碍的情况下。 (二) 与图像相关的构成很吵。 (三) 物体不容易与背景隔开。本文件述及上述三个点,并使用基于物体的SLAM系统的产出将物体与粗糙的原始生物捆绑在现场,同时与实例面具协调,查明培训图像中的阻碍因素。因此,物体自动被捆绑,非相关的几何方法被排除在NERF代表之外。该方法的性能在理想条件下进行基准测试,并对照表面和外壳面具中的错误进行测试。我们的结果显示,基于物体的内射管能够产生变化,但对实例面具的质量敏感。