Most previous studies of document-level event extraction mainly focus on building argument chains in an autoregressive way, which achieves a certain success but is inefficient in both training and inference. In contrast to the previous studies, we propose a fast and lightweight model named as PTPCG. In our model, we design a novel strategy for event argument combination together with a non-autoregressive decoding algorithm via pruned complete graphs, which are constructed under the guidance of the automatically selected pseudo triggers. Compared to the previous systems, our system achieves competitive results with 19.8\% of parameters and much lower resource consumption, taking only 3.8\% GPU hours for training and up to 8.5 times faster for inference. Besides, our model shows superior compatibility for the datasets with (or without) triggers and the pseudo triggers can be the supplements for annotated triggers to make further improvements. Codes are available at https://github.com/Spico197/DocEE .
翻译:以往关于文件级事件提取的大多数研究主要侧重于以自动递减的方式建立参数链,这取得了一定的成功,但在培训和推论方面都效率低下。与以往的研究相比,我们提出了一个称为PTPCG的快速和轻量级模型。在我们的模式中,我们设计了一个新的事件论证战略,同时,通过通过通过编造完整的图绘制的非递减解码算法,这些算法是在自动选择的假触发器的指导下构建的。与以往的系统相比,我们的系统取得了竞争性结果,参数为19.8 ⁇,资源消耗低得多,培训只用了3.8 ⁇ GPU小时,推论速度更快达8.5倍。此外,我们的模型显示数据集与(或没有)触发器的高度兼容性,假触发器可以是附加触发器的补充,以进一步改进。代码可在https://github.com/spio197/DocEEE查阅。