FUNSD is one of the limited publicly available datasets for information extraction from document im-ages. The information in the FUNSD dataset is defined by text areas of four categories ("key", "value", "header", "other", and "background") and connectivity between areas as key-value relations. In-specting FUNSD, we found several inconsistency in labeling, which impeded its applicability to thekey-value extraction problem. In this report, we described some labeling issues in FUNSD and therevision we made to the dataset. We also reported our implementation of for key-value detection onFUNSD using a UNet model as baseline results and an improved UNet model with Channel-InvariantDeformable Convolution.
翻译:FUNSD是用于从文件末位提取信息的有限公开数据集之一。FUNSD数据集中的信息由四类(“钥匙”、“价值”、“价值”、“标题”、“其他”、“背景”)的文本领域和作为关键价值关系的区域之间的连通性来定义。我们查看FUNSD时发现在标签方面存在一些不一致之处,这妨碍了其适用于关键价值提取问题。我们在本报告中描述了FUNSD中的一些标签问题,并预见到我们为数据集做了标签。我们还报告了我们使用UNet模型作为基线结果和经改进的“UNet”模型与频道-变量变异变异变异连接在一起对FUNSD进行关键价值检测的情况。