Generating font glyphs of consistent style from one or a few reference glyphs, i.e., font completion, is an important task in topographical design. As the problem is more well-defined than general image style transfer tasks, thus it has received interest from both vision and machine learning communities. Existing approaches address this problem as a direct image-to-image translation task. In this work, we innovate to explore the generation of font glyphs as 2D graphic objects with the graph as an intermediate representation, so that more intrinsic graphic properties of font styles can be captured. Specifically, we formulate a cross-modality cycled image-to-image model structure with a graph constructor between an image encoder and an image renderer. The novel graph constructor maps a glyph's latent code to its graph representation that matches expert knowledge, which is trained to help the translation task. Our model generates improved results than both image-to-image baseline and previous state-of-the-art methods for glyph completion. Furthermore, the graph representation output by our model also provides an intuitive interface for users to do local editing and manipulation. Our proposed cross-modality cycled representation learning has the potential to be applied to other domains with prior knowledge from different data modalities. Our code is available at https://github.com/VITA-Group/Font_Completion_Graph.
翻译:从一个或几个参考字形中生成一致风格的字体字形,即完成字型,这是地形设计中的一项重要任务。由于问题比一般图像样式传输任务定义得更明确,因此它受到视觉和机器学习界的兴趣。现有的方法将这一问题作为直接图像到图像翻译的任务来处理。在这项工作中,我们创新,探索以图解作为中间表示的2D图形对象生成字体字形,以便能够捕捉到字体样式的更内在的图形属性。具体地说,我们设计了一个跨模式循环图像到图像样式转换模型结构,在图像编码器和图像转换器之间有一个图形构建器。新颖的图形构建器绘制了一个与图形表达法的潜在代码,该代码与专家知识相匹配,并经过培训帮助翻译任务。我们的模型产生更好的结果,比图象到图像映射基线和以往的状态/艺术完成方法都更好。此外,我们模型的图像循环图像循环图像图像图像图像到图像转换模型的图像图像图像模型模型模型模型模型模型模型模型模型模型模型模型也为图像结构中的图形构造界面,供我们的潜在数据循环,用于进行本地编辑。