This paper studies the effect of the order of depth of mention on nested named entity recognition (NER) models. NER is an essential task in the extraction of biomedical information, and nested entities are common since medical concepts can assemble to form larger entities. Conventional NER systems only predict disjointed entities. Thus, iterative models for nested NER use multiple predictions to enumerate all entities, imposing a predefined order from largest to smallest or smallest to largest. We design an order-agnostic iterative model and a procedure to choose a custom order during training and prediction. To accommodate for this task, we propose a modification of the Transformer architecture to take into account the entities predicted in the previous steps. We provide a set of experiments to study the model's capabilities and the effects of the order on performance. Finally, we show that the smallest to largest order gives the best results.
翻译:本文研究 " 深度 " 的提及顺序对嵌入名称实体识别(NER)模型的影响。 " NER " 是提取生物医学信息的一项基本任务, " 嵌入 " 实体是常见的,因为医学概念可以组装成较大的实体。 " 常规NER系统只能预测脱节的实体。 " 嵌入 " NER的迭代模型使用多种预测来罗列所有实体,从最大到最小或最小到最大的预先定义的顺序。我们设计了 " 定序 " 迭代模式,并设计了在培训和预测期间选择定制命令的程序。为了适应这一任务,我们建议对 " 变换 " 结构进行修改,以考虑到前几个步骤所预测的实体。我们提供了一套实验,以研究模型的能力和顺序对绩效的影响。最后,我们表明,最小到最大的顺序提供了最佳结果。