Many natural language processing tasks require dealing with Named Entities (NEs) in the texts themselves and sometimes also in external knowledge sources. While this is often easy for humans, recent neural methods that rely on learned word embeddings for NLP tasks have difficulty with it, especially with out of vocabulary or rare NEs. In this paper, we propose a new neural method for this problem, and present empirical evaluations on a structured Question-Answering task, three related Goal-Oriented dialog tasks and a reading-comprehension-based task. They show that our proposed method can be effective in dealing with both in-vocabulary and out of vocabulary (OOV) NEs. We create extended versions of dialog bAbI tasks 1,2 and 4 and Out-of-vocabulary (OOV) versions of the CBT test set which will be made publicly available online.
翻译:许多自然语言处理任务都需要在文本中与命名实体打交道,有时还需要在外部知识来源中与命名实体打交道。虽然这对于人类来说往往很容易,但最近依赖为NLP任务学习的单词嵌入的神经方法有困难,特别是用词汇或稀有的NE。 在本文中,我们提出了解决这一问题的新神经方法,并对结构化的提问任务、三个相关的目标导向式对话任务和阅读综合任务提出经验性评价。它们表明,我们提出的方法可以有效地处理词汇和词汇(OOOV) NE。我们创建了BAbI任务1和4的扩大版本,以及CBBT测试集的外语(OOOV)版本,这些版本将在线公布。