We consider open domain event extraction, the task of extracting unconstraint types of events from news clusters. A novel latent variable neural model is constructed, which is scalable to very large corpus. A dataset is collected and manually annotated, with task-specific evaluation metrics being designed. Results show that the proposed unsupervised model gives better performance compared to the state-of-the-art method for event schema induction.
翻译:我们考虑开放域事件提取,这是从新闻群集中提取不受约束事件的任务。 构建了一个新的潜伏可变神经模型,该模型可以伸缩到非常大的范围。 收集了一个数据集并手工加注,并设计了具体任务评价指标。 结果显示,与最先进的事件计划诱导方法相比,拟议中的无监督模型的性能更好。