Recent entity and relation extraction works focus on investigating how to obtain a better span representation from the pre-trained encoder. However, a major limitation of existing works is that they ignore the interrelation between spans (pairs). In this work, we propose a novel span representation approach, named Packed Levitated Markers (PL-Marker), to consider the interrelation between the spans (pairs) by strategically packing the markers in the encoder. In particular, we propose a neighborhood-oriented packing strategy, which considers the neighbor spans integrally to better model the entity boundary information. Furthermore, for those more complicated span pair classification tasks, we design a subject-oriented packing strategy, which packs each subject and all its objects to model the interrelation between the same-subject span pairs. The experimental results show that, with the enhanced marker feature, our model advances baselines on six NER benchmarks, and obtains a 4.1%-4.3% strict relation F1 improvement with higher speed over previous state-of-the-art models on ACE04 and ACE05.
翻译:最近的实体和关系提取工程侧重于调查如何从经过培训的编码器中获得更好的跨度代表。然而,现有工程的一个主要局限性是它们忽略了跨线(pairs)之间的相互关系。在这项工作中,我们提议采用新的跨线代表法,名为包装脱衣标记(PL-marker),通过在编码器中战略性地包装标记,考虑跨线(pairs)之间的相互关系。特别是,我们建议采用以邻为主的包装战略,即认为邻为整体的跨度,以更好地模拟实体边界信息。此外,对于那些更为复杂的跨线分类任务,我们设计了一个面向主题的包装战略,将每个对象及其所有对象包装起来,以模拟同一对象的跨线对之间的相互关系。实验结果显示,有了强化标记特征,我们六个净基准的模型推进基线实现了4.1%-4.3%的严格关系F1改进,其速度高于以往关于ACE04和ACE05的先进模型。