Font generation is a difficult and time-consuming task, especially in those languages using ideograms that have complicated structures with a large number of characters, such as Chinese. To solve this problem, few-shot font generation and even one-shot font generation have attracted a lot of attention. However, most existing font generation methods may still suffer from (i) large cross-font gap challenge; (ii) subtle cross-font variation problem; and (iii) incorrect generation of complicated characters. In this paper, we propose a novel one-shot font generation method based on a diffusion model, named Diff-Font, which can be stably trained on large datasets. The proposed model aims to generate the entire font library by giving only one sample as the reference. Specifically, a large stroke-wise dataset is constructed, and a stroke-wise diffusion model is proposed to preserve the structure and the completion of each generated character. To our best knowledge, the proposed Diff-Font is the first work that developed diffusion models to handle the font generation task. The well-trained Diff-Font is not only robust to font gap and font variation, but also achieved promising performance on difficult character generation. Compared to previous font generation methods, our model reaches state-of-the-art performance both qualitatively and quantitatively.
翻译:生成字体是一项困难而费时的任务, 特别是在使用具有大量字符( 如中文) 的复杂结构的美方图语言中。 为了解决这个问题, 少发的字体生成甚至一发的字体生成吸引了很多注意力。 然而, 大多数现有的字体生成方法可能仍然受到以下因素的影响:(一) 巨大的交叉组合差异挑战;(二) 微妙的交叉组合变异问题;(三) 复杂字符的生成不正确。 在本文件中, 我们提议了一种新型的单发字体生成方法, 其基础是传播模型, 名为 Diff- Font, 可以在大型数据集上进行精密的培训。 拟议的模型的目的是通过仅提供一个样本作为参考来生成整个字体库。 具体地说, 正在构建一个大中转数据集, 并提出了一个中转的扩展模型, 以维护每个生成的字符的结构和完成。 根据我们的最佳知识, 拟议的 Diff- Font 是开发用于处理字体生成任务的传播模型的首项工作。 训练有素的Diff- Font 不仅能够建立字体差距和字体生成的状态, 也能够实现前一质生成。