风格合成:使用合成数据对历史文件进行语义分割 (Synthesis in Style: Semantic Segmentation of Historical Documents using Synthetic Data)

One of the most pressing problems in the automated analysis of historical documents is the availability of annotated training data. The problem is that labeling samples is a time-consuming task because it requires human expertise and thus, cannot be automated well. In this work, we propose a novel method to construct synthetic labeled datasets for historical documents where no annotations are available. We train a StyleGAN model to synthesize document images that capture the core features of the original documents. While originally, the StyleGAN architecture was not intended to produce labels, it indirectly learns the underlying semantics to generate realistic images. Using our approach, we can extract the semantic information from the intermediate feature maps and use it to generate ground truth labels. To investigate if our synthetic dataset can be used to segment the text in historical documents, we use it to train multiple supervised segmentation models and evaluate their performance. We also train these models on another dataset created by a state-of-the-art synthesis approach to show that the models trained on our dataset achieve better results while requiring even less human annotation effort.

翻译：自动分析历史文件的最紧迫问题是提供附加说明的培训数据。问题在于标签样本是一个耗时的任务,因为它需要人的专门知识,因此不可能实现自动化。在这项工作中,我们提出一种新的方法,在没有说明的情况下,为历史文件构建合成标签数据集;我们培训StyGAN模型,以综合记录原始文件的核心特征的文档图像。虽然StyleGAN结构最初不打算制作标签,但它间接地学习基本语义,以生成现实的图像。我们可以使用我们的方法,从中间地物图中提取语义信息,并利用它生成地面的真相标签。为了调查我们的合成数据集能否用于在历史文件中分割文本,我们用它来培训多个受监督的分解模型并评估其性能。我们还将这些模型放在另一个由最新综合方法创建的数据集上,以显示在我们的数据集上培训的模型取得更好的结果,而不需要人文的注解努力。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

专知会员服务

21+阅读 · 2022年3月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日