State-of-the-art models for joint entity recognition and relation extraction strongly rely on external natural language processing (NLP) tools such as POS (part-of-speech) taggers and dependency parsers. Thus, the performance of such joint models depends on the quality of the features obtained from these NLP tools. However, these features are not always accurate for various languages and contexts. In this paper, we propose a joint neural model which performs entity recognition and relation extraction simultaneously, without the need of any manually extracted features or the use of any external tool. Specifically, we model the entity recognition task using a CRF (Conditional Random Fields) layer and the relation extraction task as a multi-head selection problem (i.e., potentially identify multiple relations for each entity). We present an extensive experimental setup, to demonstrate the effectiveness of our method using datasets from various contexts (i.e., news, biomedical, real estate) and languages (i.e., English, Dutch). Our model outperforms the previous neural models that use automatically extracted features, while it performs within a reasonable margin of feature-based neural models, or even beats them.
翻译:联合实体承认和关系提取的最新模型非常依赖外部自然语言处理工具,如POS(部分语音)标记器和依赖分析器等外部自然语言处理工具,因此,这些联合模型的性能取决于从这些NLP工具中获得的特征的质量。然而,这些特征对各种语言和背景并不总是准确的。在本文件中,我们提议了一个联合神经模型,该模型同时进行实体承认和关联提取,而不需要任何手工提取的特征或使用任何外部工具。具体地说,我们用通用报告格式(有条件随机字段)层和关系提取任务作为多头选择问题(即可能确定每个实体的多重关系)来模拟实体识别任务。我们提出一个广泛的实验设置,以展示我们使用不同背景(即新闻、生物医学、房地产)和语言(即英语、荷兰语)数据集的方法的有效性。我们的模型超越了以前使用自动提取特征的神经模型,而该模型在基于地貌模型的合理空间范围内运行,甚至击败这些模型。