This work introduces a new task of instance-incremental scene graph generation: Given an empty room of the point cloud, representing it as a graph and automatically increasing novel instances. A graph denoting the object layout of the scene is finally generated. It is an important task since it helps to guide the insertion of novel 3D objects into a real-world scene in vision-based applications like augmented reality. It is also challenging because the complexity of the real-world point cloud brings difficulties in learning object layout experiences from the observation data (non-empty rooms with labeled semantics). We model this task as a conditional generation problem and propose a 3D autoregressive framework based on normalizing flows (3D-ANF) to address it. We first represent the point cloud as a graph by extracting the containing label semantics and contextual relationships. Next, a model based on normalizing flows is introduced to map the conditional generation of graphic elements into the Gaussian process. The mapping is invertible. Thus, the real-world experiences represented in the observation data can be modeled in the training phase, and novel instances can be sequentially generated based on the Gaussian process in the testing phase. We implement this new task on the dataset of 3D point-based scenes (3DSSG and 3RScan) and evaluate the performance of our method. Experiments show that our method generates reliable novel graphs from the real-world point cloud and achieves state-of-the-art performance on the benchmark dataset.
翻译:这项工作引入了一种新任务, 包括: 在点云空空空空的房间里, 以图解形式代表它, 并自动增加新的实例 。 最终生成了一个图表, 显示场景的物体布局 。 这是一项重要的任务, 因为它有助于引导将新的 3D 对象插入一个真实世界的场景, 像增强现实一样以视觉为基础的应用程序 。 由于真实世界点云的复杂, 学习观察数据( 带有标签语义的非空房间) 的天体布局经验非常困难 。 因此, 将此项任务建为有条件的生成问题, 并提议一个基于正常化流程( 3D- ATF) 的 3D 自动递增框架来解决这个问题 。 我们首先通过提取含有标签语义和背景关系的图像, 将点云作为图表的图解作为图表 。 下一个基于正常化流的模型, 将有条件生成的图形元素纳入高司进程 。 绘图是不可忽略的 。 因此, 观察数据中所代表的真实世界经验可以建模在培训阶段建模,, 并且 将新的 以 3D 度 测试我们 的 的 的 的 将运行方法 用于 的 。