Existing methods in relation extraction have leveraged the lexical features in the word sequence and the syntactic features in the parse tree. Though effective, the lexical features extracted from the successive word sequence may introduce some noise that has little or no meaningful content. Meanwhile, the syntactic features are usually encoded via graph convolutional networks which have restricted receptive field. To address the above limitations, we propose a multi-scale feature and metric learning framework for relation extraction. Specifically, we first develop a multi-scale convolutional neural network to aggregate the non-successive mainstays in the lexical sequence. We also design a multi-scale graph convolutional network which can increase the receptive field towards specific syntactic roles. Moreover, we present a multi-scale metric learning paradigm to exploit both the feature-level relation between lexical and syntactic features and the sample-level relation between instances with the same or different classes. We conduct extensive experiments on three real world datasets for various types of relation extraction tasks. The results demonstrate that our model significantly outperforms the state-of-the-art approaches.
翻译:相关提取的现有方法已利用了单词序列中的词汇特征和剖析树中的合成特征。尽管这些词汇特征是有效的,但从连续的单词序列中提取的词汇特征可能会带来一些没有实际内容或没有实际内容的噪音。与此同时,这些合成特征通常通过图形共进网络编码,这些网络限制了可接受字段。为了解决上述局限性,我们提议了一个多尺度特征和衡量学习框架,用于相关提取。具体地说,我们首先开发了一个多尺度的革命神经网络,以汇总词汇序列中的非继承支柱。我们还设计了一个多尺度的图形共进化网络,可以将接收字段扩大到特定的合成作用。此外,我们提出了一个多尺度的参数学习模式,既利用地貌特征与合成特征之间的关系,又利用与相同或不同类别之间样本级的关系。我们对三种真实的世界数据集进行了广泛的实验,用于各种类型的关联提取任务。结果表明,我们的模型明显地超越了状态方法。