少数片点文件级活动 (Few-Shot Document-Level Event Argument Extraction)

Event argument extraction (EAE) has been well studied at the sentence level but under-explored at the document level. In this paper, we study to capture event arguments that actually spread across sentences in documents. Prior works mainly assume full access to rich document supervision, ignoring the fact that the argument supervision is limited in documents. To fill this gap, we present FewDocAE, a Few-Shot Document-Level Event Argument Extraction benchmark, based on the largest document-level event extraction dataset DocEE. We first define the new problem and reconstruct the corpus by a novel N-Way-D-Doc sampling instead of the traditional N-Way-K-Shot strategy. Then we adjust the advanced document-level neural models into the few-shot setting to provide baseline results under in- and cross-domain settings. Since the argument extraction depends on the context from multiple sentences and the learning process is limited to very few examples, we find the task to be very challenging with substantively low performance. Considering FewDocAE is closely related to practical use under low-resource regimes, we hope this benchmark encourages more research in this direction. Our data and codes will be available online.

翻译：在判决一级对事件提取参数(EAE)进行了很好的研究,但在文件一级探索不足。在本文件中,我们研究的是捕捉实际散布在文档中各句子的事件参数。先前的工作主要假设充分接触丰富的文件监督, 忽略了对参数监督的限制这一事实。为了填补这一空白, 我们介绍了根据最大的文件级事件提取数据集( DocE), 少许点文件级文件级事件提取参数( EAE) 基准。我们首先定义了新问题, 并用新颖的N- Way- D- Doc 取样法而不是传统的N- Way- K- Shot 战略来重新构建文件库。然后我们将高级文件级神经模型调整到少数点设置中, 在多处设置下提供基线结果。由于参数提取取决于多个句子的背景, 学习过程仅限于极少数例子, 我们发现任务非常艰巨, 表现非常低。考虑到 FewDoCAE 与低资源制度下的实际使用密切相关, 我们希望这一基准能鼓励更多在线研究。

相关内容

小样本学习

关注 215

小样本学习（Few-Shot Learning，以下简称 FSL ）用于解决当可用的数据量比较少时，如何提升神经网络的性能。在 FSL 中，经常用到的一类方法被称为 Meta-learning。和普通的神经网络的训练方法一样，Meta-learning 也包含训练过程和测试过程，但是它的训练过程被称作 Meta-training 和 Meta-testing。

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日