Relation extraction has the potential for large-scale knowledge graph construction, but current methods do not consider the qualifier attributes for each relation triplet, such as time, quantity or location. The qualifiers form hyper-relational facts which better capture the rich and complex knowledge graph structure. For example, the relation triplet (Leonard Parker, Educated At, Harvard University) can be factually enriched by including the qualifier (End Time, 1967). Hence, we propose the task of hyper-relational extraction to extract more specific and complete facts from text. To support the task, we construct HyperRED, a large-scale and general-purpose dataset. Existing models cannot perform hyper-relational extraction as it requires a model to consider the interaction between three entities. Hence, we propose CubeRE, a cube-filling model inspired by table-filling approaches and explicitly considers the interaction between relation triplets and qualifiers. To improve model scalability and reduce negative class imbalance, we further propose a cube-pruning method. Our experiments show that CubeRE outperforms strong baselines and reveal possible directions for future research. Our code and data are available at github.com/declare-lab/HyperRED.
翻译:大规模知识图表的构造有可能具有大规模的关系提取潜力,但目前的方法并不考虑每个关系三重的限定性属性,例如时间、数量或位置。限定性词形成超关系事实,更好地捕捉丰富和复杂的知识图表结构。例如,三重关系(Leonard Parker, 哈佛大学教育的At At)可以通过包含限定性来实际丰富。因此,我们提议超关系提取任务,以便从文本中提取更具体和完整的事实。为了支持这项任务,我们建造了一个大型和通用的超关系数据集。现有的模型无法进行超关系提取,因为它需要一个模型来考虑三个实体之间的互动。因此,我们提出CubeRE,一个受填表方法启发的立方填充模型,并明确考虑三重关系和定性者之间的互动。为了改进模型的可缩放性和减少负级失衡,我们进一步提议一个立方体运行方法。我们的实验显示, CubeRE 超越 brubrbs 和显示未来研究的可能方向。