Data scarcity and imbalance have been the main factors that hinder the progress of event extraction (EE). In this work, we propose a self-training with gradient guidance (STGG) framework which consists of (1) a base event extraction model which is firstly trained on existing event annotations and then applied to large-scale unlabeled corpora to predict new event mentions, and (2) a scoring model that takes in each predicted event trigger and argument as well as their path in the Abstract Meaning Representation (AMR) graph to estimate a probability score indicating the correctness of the event prediction. The new event predictions along with their correctness scores are then used as pseudo labeled examples to improve the base event extraction model while the magnitude and direction of its gradients are guided by the correctness scores. Experimental results on three benchmark datasets, including ACE05-E, ACE05-E+ and ERE-EN, demonstrate the effectiveness of the STGG framework on event extraction task with up to 1.9 F-score improvement over the base event extraction models. Our experimental analysis further shows that STGG is a general framework as it can be applied to any base event extraction models and improve their performance by leveraging broad unlabeled data, even when the high-quality AMR graph annotations are not available.
翻译:数据稀缺和不平衡是阻碍事件提取进展的主要因素。在这项工作中,我们提议了一个带有梯度指导(STGG)框架的自我培训,该框架包括:(1) 基础事件提取模型,首先根据现有事件说明进行培训,然后适用于大型无标签公司,以预测新事件提到的情况;(2) 评分模型,在每个预测事件触发和论证及其路径中采用评分模型,以估计概率分数,表明事件预测的正确性能。然后,新事件的预测及其正确性分数被用作假标签示例,以改进基础事件提取模型,而其梯度的规模和方向则以正确性分数为指导。 三个基准数据集的实验结果,包括ACE05-E、ACE05-E+和ERE-EN,显示STGG框架在事件提取任务上的有效性,比基本事件提取模型的准确性提高了1.9法郎。 我们的实验分析还显示,新事件预测及其正确性分数是一个总框架,可以适用于任何基准事件提取模型,而其梯度和方向则以正确性计计值计。