Bi-encoder architectures for distantly-supervised relation extraction are designed to make use of the complementary information found in text and knowledge graphs (KG). However, current architectures suffer from two drawbacks. They either do not allow any sharing between the text encoder and the KG encoder at all, or, in case of models with KG-to-text attention, only share information in one direction. Here, we introduce cross-stitch bi-encoders, which allow full interaction between the text encoder and the KG encoder via a cross-stitch mechanism. The cross-stitch mechanism allows sharing and updating representations between the two encoders at any layer, with the amount of sharing being dynamically controlled via cross-attention-based gates. Experimental results on two relation extraction benchmarks from two different domains show that enabling full interaction between the two encoders yields strong improvements.
翻译:用于远程监督关系提取的双编码器结构的设计,是为了利用文本和知识图形(KG)中发现的补充信息。然而,目前的结构有两个缺点。要么不允许文本编码器和KG编码器之间进行任何共享,要么在KG到文字注意的模型中,只共享一个方向的信息。这里,我们引入了跨丝串双编码器,通过跨丝滴机制使文本编码器和KG编码器充分互动。跨丝串机制允许两个编码器在任何一层之间共享和更新表达方式,而共享的程度则通过跨注意大门动态控制。两个不同领域的两个关系提取基准的实验结果显示,使两个编码器之间能够全面互动,可以产生很大的改进。