Relation extraction is a central task in natural language processing (NLP) and information retrieval (IR) research. We argue that an important type of relation not explored in NLP or IR research to date is that of an event being an argument - required or optional - of another event. We introduce the human-annotated Event Dependency Relation dataset (EDeR) which provides this dependency relation. The annotation is done on a sample of documents from the OntoNotes dataset, which has the added benefit that it integrates with existing, orthogonal, annotations of this dataset. We investigate baseline approaches for predicting the event dependency relation, the best of which achieves an accuracy of 82.61 for binary argument/non-argument classification. We show that recognizing this relation leads to more accurate event extraction (semantic role labelling) and can improve downstream tasks that depend on this, such as co-reference resolution. Furthermore, we demonstrate that predicting the three-way classification into the required argument, optional argument or non-argument is a more challenging task.
翻译:关系抽取是自然语言处理(NLP)和信息检索(IR)研究中的核心任务。我们认为,NLP或IR研究中尚未探索的一个重要类型的关系是事件作为另一个事件的参数(必需的或可选的参数)。我们介绍了人工注释的事件依赖关系数据集(EDeR),该数据集提供这种依赖关系。注释在OntoNotes数据集的文档样本上进行,这具有与现有的正交注释集成的附加优势。我们研究了预测事件依赖关系的基线方法,其中最佳方法的二分类参数/非参数分类准确率达到82.61。我们表明,识别这种关系可以导致更准确的事件提取(语义角色标记)并可以改善取决于此的下游任务,例如共指消解。此外,我们证明,预测三分分类为必需参数、可选参数或非参数是一个更具挑战性的任务。