DeepVecFont-v2: Exploiting Transformers to Synthesize Vector Fonts with Higher Quality (DeepVecFont-v2: Exploiting Transformers to Synthesize Vector Fonts with Higher Quality)

Vector font synthesis is a challenging and ongoing problem in the fields of Computer Vision and Computer Graphics. The recently-proposed DeepVecFont achieved state-of-the-art performance by exploiting information of both the image and sequence modalities of vector fonts. However, it has limited capability for handling long sequence data and heavily relies on an image-guided outline refinement post-processing. Thus, vector glyphs synthesized by DeepVecFont still often contain some distortions and artifacts and cannot rival human-designed results. To address the above problems, this paper proposes an enhanced version of DeepVecFont mainly by making the following three novel technical contributions. First, we adopt Transformers instead of RNNs to process sequential data and design a relaxation representation for vector outlines, markedly improving the model's capability and stability of synthesizing long and complex outlines. Second, we propose to sample auxiliary points in addition to control points to precisely align the generated and target B\'ezier curves or lines. Finally, to alleviate error accumulation in the sequential generation process, we develop a context-based self-refinement module based on another Transformer-based decoder to remove artifacts in the initially synthesized glyphs. Both qualitative and quantitative results demonstrate that the proposed method effectively resolves those intrinsic problems of the original DeepVecFont and outperforms existing approaches in generating English and Chinese vector fonts with complicated structures and diverse styles.

翻译：翻译标题：DeepVecFont-v2：利用Transformers合成更高质量的矢量字体翻译摘要：矢量字体合成是计算机视觉和计算机图形领域中具有挑战性的问题。最近提出的DeepVecFont通过利用矢量字体图像和序列模态的信息实现了最先进的性能。然而，它在处理长序列数据方面的能力有限，并且严重依赖于一个基于图像引导的轮廓精化后处理方法。因此，DeepVecFont合成的矢量字形仍然经常包含一些扭曲和伪影，无法与人类设计的结果相媲美。为了解决以上问题，本文提出了DeepVecFont的增强版本，主要通过以下三个创新技术贡献来实现。首先，我们采用Transformers代替RNN来处理序列数据，并设计了一种松弛表示方法来标记矢量轮廓，显著提高了模型合成长和复杂轮廓的能力和稳定性。其次，我们提出采样辅助点来与控制点精确对齐生成和目标Bézier曲线或线段。最后，为了减轻顺序生成过程中的误差积累，我们开发了一个基于上下文的自我精化模块，基于另一个基于Transformer的解码器，可以消除一开始合成字形中的伪影。定量和定性结果都表明，所提出的方法有效地解决了原始DeepVecFont的内在问题，并在生成具有复杂结构和多样化风格的英文和中文矢量字体方面优于现有方法。