Distant supervision (DS) aims to generate large-scale heuristic labeling corpus, which is widely used for neural relation extraction currently. However, it heavily suffers from noisy labeling and long-tail distributions problem. Many advanced approaches usually separately address two problems, which ignore their mutual interactions. In this paper, we propose a novel framework named RH-Net, which utilizes Reinforcement learning and Hierarchical relational searching module to improve relation extraction. We leverage reinforcement learning to instruct the model to select high-quality instances. We then propose the hierarchical relational searching module to share the semantics from correlative instances between data-rich and data-poor classes. During the iterative process, the two modules keep interacting to alleviate the noisy and long-tail problem simultaneously. Extensive experiments on widely used NYT data set clearly show that our method significant improvements over state-of-the-art baselines.
翻译:远程监管(DS)旨在产生大规模超链接标签,目前广泛用于神经关系提取,但严重受噪音标签和长尾分发问题的影响。许多先进方法通常分别解决两个问题,而这两个问题忽视了它们之间的相互作用。在本文件中,我们提议了一个名为RH-Net的新框架,它利用强化学习和等级关系搜索模块来改进关系提取。我们利用强化学习来指导模型选择高质量实例。然后我们提出等级关系搜索模块,以分享数据丰富和数据贫乏类别之间相关案例的语义。在迭接过程中,两个模块保持互动,以同时缓解噪音和长尾问题。关于广泛使用的NYT数据集的广泛实验清楚地表明,我们的方法大大改进了最先进的基线。