Syntax has been shown to benefit Coreference Resolution from incorporating long-range dependencies and structured information captured by syntax trees, either in traditional statistical machine learning based systems or recently proposed neural models. However, most leading systems use only dependency trees. We argue that constituent trees also encode important information, such as explicit span-boundary signals captured by nested multi-word phrases, extra linguistic labels and hierarchical structures useful for detecting anaphora. In this work, we propose a simple yet effective graph-based method to incorporate constituent syntactic structures. Moreover, we also explore to utilise higher-order neighbourhood information to encode rich structures in constituent trees. A novel message propagation mechanism is therefore proposed to enable information flow among elements in syntax trees. Experiments on the English and Chinese portions of OntoNotes 5.0 benchmark show that our proposed model either beats a strong baseline or achieves new state-of-the-art performance. (Code is available at https://github.com/Fantabulous-J/Coref-Constituent-Graph)
翻译:语法被证明有利于Coferation Result, 将长距离依赖性和由语法树收集的结构性信息纳入传统统计机学习系统或最近提出的神经模型,但是,大多数主要系统只使用依赖性树。我们争辩说,成份树也编码重要信息,例如由巢状多字词组、额外语言标签和等级结构所捕捉的清晰的跨边界信号,这些信号可用于探测肛门。在这项工作中,我们提议一种简单而有效的图形化方法,以纳入成份合成结构。此外,我们还探索利用更高层次的街道信息,对成份树中的丰富结构进行编码。因此,提议了一个新的信息传播机制,以便能够在语法树中的元素之间进行信息流动。对Onto Notes 5.0基准的英文和中文部分进行的实验表明,我们提议的模型要么打破了强大的基线,要么实现了新的“艺术”性能。(Code可以在https://github.com/Fantabrous-J/Coref-Contistitateent-Graph)