Deep neural models for low-resource named entity recognition (NER) have shown impressive results by leveraging distant super-vision or other meta-level information (e.g. explanation). However, the costs of acquiring such additional information are generally prohibitive, especially in domains where existing resources (e.g. databases to be used for distant supervision) may not exist. In this paper, we present a novel two-stage framework (AutoTriggER) to improve NER performance by automatically generating and leveraging "entity triggers" which are essentially human-readable clues in the text that can help guide the model to make better decisions. Thus, the framework is able to both create and leverage auxiliary supervision by itself. Through experiments on three well-studied NER datasets, we show that our automatically extracted triggers are well-matched to human triggers, and AutoTriggER improves performance over a RoBERTa-CRFarchitecture by nearly 0.5 F1 points on average and much more in a low resource setting.
翻译:低资源命名实体识别(NER)的深神经模型通过利用远程超视或其他元级信息(例如解释),显示了令人印象深刻的成果。然而,获取这类额外信息的成本一般都令人望而却步,特别是在现有资源(例如用于远程监督的数据库)可能不存在的领域。在本文中,我们提出了一个创新的两阶段框架(AutoTrigger),通过自动生成和利用“实体触发器”来改善净化性能,这些“实体触发器”基本上是可以人类阅读的线索,可以帮助指导模型做出更好的决定。因此,该框架既能够创建又能够自行利用辅助性监督。通过三个研究周密的NER数据集的实验,我们表明自动提取的触发器与人类触发器非常匹配,AutoTrigger在低资源环境下平均和多得多地提高0.5个F1点的RobERTA-CRFArgicturation的性能。