In this paper, we propose a span based model combined with syntactic information for n-ary open information extraction. The advantage of span model is that it can leverage span level features, which is difficult in token based BIO tagging methods. We also improve the previous bootstrap method to construct training corpus. Experiments show that our model outperforms previous open information extraction systems. Our code and data are publicly available at https://github.com/zhanjunlang/Span_OIE
翻译:在本文中,我们提出了一个基于跨线的模型,结合N-ary公开信息提取的合成信息。跨线模型的优势在于它能够利用跨线的特性,在象征性的BIO标记方法上,这是困难的。我们还改进了以前用来构建训练材料的“陷阱”方法。实验表明,我们的模型比以前的开放信息提取系统要好。我们的代码和数据可在https://github.com/zhanjunlang/Span_OIE公开查阅。