SLOGAN: 任意语言和校外文字手写风格合成 (SLOGAN: Handwriting Style Synthesis for Arbitrary-Length and Out-of-Vocabulary Text)

Large amounts of labeled data are urgently required for the training of robust text recognizers. However, collecting handwriting data of diverse styles, along with an immense lexicon, is considerably expensive. Although data synthesis is a promising way to relieve data hunger, two key issues of handwriting synthesis, namely, style representation and content embedding, remain unsolved. To this end, we propose a novel method that can synthesize parameterized and controllable handwriting Styles for arbitrary-Length and Out-of-vocabulary text based on a Generative Adversarial Network (GAN), termed SLOGAN. Specifically, we propose a style bank to parameterize the specific handwriting styles as latent vectors, which are input to a generator as style priors to achieve the corresponding handwritten styles. The training of the style bank requires only the writer identification of the source images, rather than attribute annotations. Moreover, we embed the text content by providing an easily obtainable printed style image, so that the diversity of the content can be flexibly achieved by changing the input printed image. Finally, the generator is guided by dual discriminators to handle both the handwriting characteristics that appear as separated characters and in a series of cursive joins. Our method can synthesize words that are not included in the training vocabulary and with various new styles. Extensive experiments have shown that high-quality text images with great style diversity and rich vocabulary can be synthesized using our method, thereby enhancing the robustness of the recognizer.

翻译：培训强大的文本辨识器迫切需要大量标签数据。然而, 收集不同风格的笔迹数据, 以及庞大的词汇, 费用相当昂贵。尽管数据合成是缓解数据饥饿的一个很有希望的方法, 但笔迹合成的两个关键问题, 即风格表达和内容嵌入, 仍未解决。为此, 我们提出了一个新颖的方法, 可以将任意的Length 和外版格式的参数化和控制性笔迹样式结合起来, 以创制式反versarial 网络( GAN) 为基础, 叫做 SLOGAN 。具体地说, 我们建议建立一个风格银行, 将特定的笔迹样式作为潜在的矢量进行参数化, 将它作为生成者实现相应的手写样式的输入。对风格库的培训仅需要作者识别源图像, 而不是属性说明。此外, 我们通过提供易于获取的印刷风格图像( GAN), 使内容的多样化能够通过修改印刷图像来灵活实现。最后, 我们的双重区分器指导着将特定的笔迹样式作为潜在的矢量,, 并且将我们的笔迹质化方法中显示为高层次。。将的的的将将的格式化的格式的和的的将的的格式化和的的的的的的的将的的的的和的的格式化的的的的的的的的的的的的和的的的的和的的的的的的的和的的的的的的的的的的的的的的的的的的的的的的的和的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的

相关内容