为关系采掘而进行模拟代表性学习 (Prototypical Representation Learning for Relation Extraction)

Recognizing relations between entities is a pivotal task of relational learning. Learning relation representations from distantly-labeled datasets is difficult because of the abundant label noise and complicated expressions in human language. This paper aims to learn predictive, interpretable, and robust relation representations from distantly-labeled data that are effective in different settings, including supervised, distantly supervised, and few-shot learning. Instead of solely relying on the supervision from noisy labels, we propose to learn prototypes for each relation from contextual information to best explore the intrinsic semantics of relations. Prototypes are representations in the feature space abstracting the essential semantics of relations between entities in sentences. We learn prototypes based on objectives with clear geometric interpretation, where the prototypes are unit vectors uniformly dispersed in a unit ball, and statement embeddings are centered at the end of their corresponding prototype vectors on the surface of the ball. This approach allows us to learn meaningful, interpretable prototypes for the final classification. Results on several relation learning tasks show that our model significantly outperforms the previous state-of-the-art models. We further demonstrate the robustness of the encoder and the interpretability of prototypes with extensive experiments.

翻译：承认实体之间的关系是关系学习的关键任务。从遥远标签的数据集学习关系表征是困难的,因为标签噪音和复杂的人类语言表达方式十分丰富。本文件的目的是从在不同环境中有效的远标签数据中学习预测、可解释和强有力的关系表征,这些数据在不同环境中有效,包括受监督、受远程监督的和少见的学习。我们建议,不要仅仅依靠噪音标签的监督,而要从背景信息中学习每种关系的原型,最好从背景信息中学习内在的语义。原型是空间中摘取各句中实体关系的基本语义的特征。我们学习基于明确几何解释目标的原型,在这些原型是单元球中统一分布的单位矢量,而声明嵌入的中心是球表面相应的原型矢量的结尾。这种方法使我们能够学习有意义的、可解释的终级分类原型。一些相关学习任务的结果显示,我们的模型明显超越了先前的状态模型的语义。我们进一步展示了原型的坚固性,并展示了原型的原型。我们进一步展示了原型的精准性。