To alleviate human efforts from obtaining large-scale annotations, Semi-Supervised Relation Extraction methods aim to leverage unlabeled data in addition to learning from limited samples. Existing self-training methods suffer from the gradual drift problem, where noisy pseudo labels on unlabeled data are incorporated during training. To alleviate the noise in pseudo labels, we propose a method called MetaSRE, where a Relation Label Generation Network generates quality assessment on pseudo labels by (meta) learning from the successful and failed attempts on Relation Classification Network as an additional meta-objective. To reduce the influence of noisy pseudo labels, MetaSRE adopts a pseudo label selection and exploitation scheme which assesses pseudo label quality on unlabeled samples and only exploits high-quality pseudo labels in a self-training fashion to incrementally augment labeled samples for both robustness and accuracy. Experimental results on two public datasets demonstrate the effectiveness of the proposed approach.
翻译:为了减轻假标签中的噪音,我们建议一种名为MetaSRE的方法,即“Relation Label Group”网络通过(meta)学习成功和失败的关系分类网络,对假标签进行质量评估,作为额外的元目标。为了减少杂音假标签的影响,MetaSRE采用了一种假标签选择和开发办法,评估未贴标签样品上的假标签质量,并且只利用高品质的假标签进行自我培训,以逐步增加标签样本,使其具有稳健性和准确性。两个公共数据集的实验结果显示了拟议方法的有效性。