Event detection (ED), a sub-task of event extraction, involves identifying triggers and categorizing event mentions. Existing methods primarily rely upon supervised learning and require large-scale labeled event datasets which are unfortunately not readily available in many real-life applications. In this paper, we consider and reformulate the ED task with limited labeled data as a Few-Shot Learning problem. We propose a Dynamic-Memory-Based Prototypical Network (DMB-PN), which exploits Dynamic Memory Network (DMN) to not only learn better prototypes for event types, but also produce more robust sentence encodings for event mentions. Differing from vanilla prototypical networks simply computing event prototypes by averaging, which only consume event mentions once, our model is more robust and is capable of distilling contextual information from event mentions for multiple times due to the multi-hop mechanism of DMNs. The experiments show that DMB-PN not only deals with sample scarcity better than a series of baseline models but also performs more robustly when the variety of event types is relatively large and the instance quantity is extremely small.
翻译:事件探测(ED)是事件提取的子任务,它涉及确定触发因素和分类事件。现有方法主要依靠监督的学习,需要大规模标记的事件数据集,但不幸的是,在许多现实应用中,这些数据集并不容易获得。在本文中,我们考虑和重新拟订ED任务,将有限的标记数据作为少于零的学习问题。我们建议建立一个动态-计量-基于动态记忆网络(DMB-PN),它不仅利用动态记忆网络(DMN)来学习事件类型的更好的原型,而且还产生更强的句子编码。从香草原型网络到按平均计算事件原型(仅使用事件一次),我们的模型更加健全,并且能够从多机机制下多次提到的事件中提取背景信息。实验表明,DMB-PN不仅处理抽样短缺问题比一系列基线模型要好,而且在事件种类比较大、实例数量极小的情况下,还进行更强有力的演练。