We present a generative model for complex free-form structures such as stroke-based drawing tasks. While previous approaches rely on sequence-based models for drawings of basic objects or handwritten text, we propose a model that treats drawings as a collection of strokes that can be composed into complex structures such as diagrams (e.g., flow-charts). At the core of the approach lies a novel autoencoder that projects variable-length strokes into a latent space of fixed dimension. This representation space allows a relational model, operating in latent space, to better capture the relationship between strokes and to predict subsequent strokes. We demonstrate qualitatively and quantitatively that our proposed approach is able to model the appearance of individual strokes, as well as the compositional structure of larger diagram drawings. Our approach is suitable for interactive use cases such as auto-completing diagrams. We make code and models publicly available at https://eth-ait.github.io/cose.
翻译:我们提出了一个复杂的自由形式结构的基因模型,例如中风绘图任务。虽然以前的方法依赖于基于基本对象或手写文字绘图的顺序模型,但我们提出了一个模型,将绘图作为可组成图表(如流程图)等复杂结构的划线集处理。这个方法的核心是一个新的自动编码器,将可变长的划线投入固定维度的潜在空间。这个代表空间允许一种在潜空运行的关系模型,以更好地捕捉划线之间的关系并预测随后的划线。我们从质量和数量上表明,我们提议的方法能够模拟单个划线的外观以及大图绘制的构成结构。我们的方法适合于诸如自动完成图表等交互式使用案例。我们在https://eth-ait.github.io/cose上公开提供代码和模型。