When extracting information from handwritten documents, text transcription and named entity recognition are usually faced as separate subsequent tasks. This has the disadvantage that errors in the first module affect heavily the performance of the second module. In this work we propose to do both tasks jointly, using a single neural network with a common architecture used for plain text recognition. Experimentally, the work has been tested on a collection of historical marriage records. Results of experiments are presented to show the effect on the performance for different configurations: different ways of encoding the information, doing or not transfer learning and processing at text line or multi-line region level. The results are comparable to state of the art reported in the ICDAR 2017 Information Extraction competition, even though the proposed technique does not use any dictionaries, language modeling or post processing.
翻译:在从手写文件、文本抄录和名称实体识别中提取信息时,通常会作为单独的后续任务而面临。这不利于第一个模块的错误严重影响第二个模块的性能。在这项工作中,我们提议使用单一神经网络和用于纯文本识别的共同结构,共同执行这两项任务。实验性地在收集历史婚姻记录时测试了这项工作。实验结果显示对不同配置的性能的影响:在文本线或多线区域层面将信息编码、进行或不转移学习和处理的不同方式。结果与在ICDAR 2017信息提取竞赛中所报道的艺术水平相似,尽管拟议的技术并不使用任何词典、语言建模或后处理。