A neural radiance field (NeRF) is a scene model supporting high-quality view synthesis, optimized per scene. In this paper, we explore enabling user editing of a category-level NeRF - also known as a conditional radiance field - trained on a shape category. Specifically, we introduce a method for propagating coarse 2D user scribbles to the 3D space, to modify the color or shape of a local region. First, we propose a conditional radiance field that incorporates new modular network components, including a shape branch that is shared across object instances. Observing multiple instances of the same category, our model learns underlying part semantics without any supervision, thereby allowing the propagation of coarse 2D user scribbles to the entire 3D region (e.g., chair seat). Next, we propose a hybrid network update strategy that targets specific network components, which balances efficiency and accuracy. During user interaction, we formulate an optimization problem that both satisfies the user's constraints and preserves the original object structure. We demonstrate our approach on various editing tasks over three shape datasets and show that it outperforms prior neural editing approaches. Finally, we edit the appearance and shape of a real photograph and show that the edit propagates to extrapolated novel views.
翻译:神经光亮场( NERF) 是一个支持高质量视图合成的场景模型, 每个场景都优化了。 在本文中, 我们探索一个功能化的用户编辑工具, 在形状类别上培训了一个类别级的 NERF - 也称为有条件的光亮场 。 具体地说, 我们引入了一种向 3D 空间 传播粗2D 用户刻字的方法, 以修改本地区域的颜色或形状 。 首先, 我们提出一个有条件的光亮场, 包含新的模块网络组件, 包括一个在天体中共享的形状分支 。 观察同一类别的多个实例, 我们的模型在没有任何监督的情况下学习了内部部分的语义, 从而允许向整个 3D 区域( 如主席座位) 传播粗2D 用户刻字。 下一步, 我们提出一个混合网络更新策略, 瞄准特定的网络组件, 以平衡效率和准确性。 在用户互动中, 我们提出一个最优化的问题, 既满足用户的制约, 也保存原始对象结构 。 我们展示了我们关于三个形状数据集的各种编辑任务的方法, 并显示它超越了之前的图像的外观。 最后, 我们修改了真实的图像的外观。