This paper presents a novel framework, MGNER, for Multi-Grained Named Entity Recognition where multiple entities or entity mentions in a sentence could be non-overlapping or totally nested. Different from traditional approaches regarding NER as a sequential labeling task and annotate entities consecutively, MGNER detects and recognizes entities on multiple granularities: it is able to recognize named entities without explicitly assuming non-overlapping or totally nested structures. MGNER consists of a Detector that examines all possible word segments and a Classifier that categorizes entities. In addition, contextual information and a self-attention mechanism are utilized throughout the framework to improve the NER performance. Experimental results show that MGNER outperforms current state-of-the-art baselines up to 4.4% in terms of the F1 score among nested/non-overlapping NER tasks.
翻译:本文为多重命名实体识别提供了一个新的框架MGNER, 其中多个实体或实体在句子中提及的内容可以是非重叠或完全嵌套的。与将净入学率作为连续标签任务和连续批注实体的传统方法不同,MGNER检测和识别多个颗粒体的实体:它可以在没有明确假定非重叠或完全嵌套结构的情况下识别被点名的实体。MGNER包括一个检查所有可能的单词部分的检测器和一个对实体进行分类的分类器。此外,整个框架都利用了背景信息和自留机制来改进净入学率的绩效。实验结果显示,MGNER在嵌套/非重叠净入学率任务中比目前最先进的基线高出4.4%的F1分。