In dialogue systems, the tasks of named entity recognition (NER) and named entity linking (NEL) are vital preprocessing steps for understanding user intent, especially in open domain interaction where we cannot rely on domain-specific inference. UCSC's effort as one of the funded teams in the 2017 Amazon Alexa Prize Contest has yielded Slugbot, an open domain social bot, aimed at casual conversation. We discovered several challenges specifically associated with both NER and NEL when building Slugbot, such as that the NE labels are too coarse-grained or the entity types are not linked to a useful ontology. Moreover, we have discovered that traditional approaches do not perform well in our context: even systems designed to operate on tweets or other social media data do not work well in dialogue systems. In this paper, we introduce Slugbot's Named Entity Recognition for dialogue Systems (SlugNERDS), a NER and NEL tool which is optimized to address these issues. We describe two new resources that we are building as part of this work: SlugEntityDB and SchemaActuator. We believe these resources will be useful for the research community.
翻译:在对话系统中,名称实体识别(NER)和名称实体链接(NEL)的任务对于理解用户意图至关重要的预处理步骤,特别是在我们无法依赖特定域的推断的开放域互动中。UCSC作为2017年亚马逊亚马逊亚历山大奖竞赛中受资助的团队之一所做的努力产生了Slugbot(开放域社会机器人),目的是随意交谈。我们在建造Slugbot时发现了一些具体与NER和名称链接(NEL)相关的若干挑战,例如NE标签过于粗糙,或者实体类型与有用的本体学没有联系。此外,我们发现传统方法在我们的环境下效果不佳:即使设计用于在推特上操作的系统或其他社会媒体数据在对话系统中也不起作用。在本文中,我们引入了Slugbot命名的“对话系统实体识别”(SlugNERDS),这是为解决这些问题而最优化的一种NER和NEL工具。我们描述作为这项工作的一部分正在建设的两种新资源:SluEnfentity DDB和Shomator。我们相信,这些资源将会成为这项工作的一部分。