Scalable generation of furniture layouts is essential for many applications in virtual reality, augmented reality, game development and synthetic data generation. Many existing methods tackle this problem as a sequence generation problem which imposes a specific ordering on the elements of the layout making such methods impractical for interactive editing or scene completion. Additionally, most methods focus on generating layouts unconditionally and offer minimal control over the generated layouts. We propose COFS, an architecture based on standard transformer architecture blocks from language modeling. The proposed model is invariant to object order by design, removing the unnatural requirement of specifying an object generation order. Furthermore, the model allows for user interaction at multiple levels enabling fine grained control over the generation process. Our model consistently outperforms other methods which we verify by performing quantitative evaluations. Our method is also faster to train and sample from, compared to existing methods.
翻译:在虚拟现实中,可缩放的家具布局对于许多应用、扩大现实、游戏开发和合成数据生成至关重要。许多现有方法将这一问题作为一个序列生成问题加以解决,对布局各要素进行具体排序,使这些布局各要素不切实际,无法进行互动编辑或场景完成。此外,大多数方法侧重于生成布局,对生成的布局提供最小的控制。我们建议以标准变压器结构块为基础,不使用语言建模的建筑COFS。拟议模式无法通过设计来按部就班,消除指定物体生成顺序的不正常要求。此外,该模式允许用户在多个级别进行互动,以便对生成过程进行精细的谷底控制。我们的模型一贯地优于其他方法,我们通过进行定量评估来核实。我们的方法也比现有方法更快地培训和抽样。