Negative sampling (NS) loss plays an important role in learning knowledge graph embedding (KGE) to handle a huge number of entities. However, the performance of KGE degrades without hyperparameters such as the margin term and number of negative samples in NS loss being appropriately selected. Currently, empirical hyperparameter tuning addresses this problem at the cost of computational time. To solve this problem, we theoretically analyzed NS loss to assist hyperparameter tuning and understand the better use of the NS loss in KGE learning. Our theoretical analysis showed that scoring methods with restricted value ranges, such as TransE and RotatE, require appropriate adjustment of the margin term or the number of negative samples different from those without restricted value ranges, such as RESCAL, ComplEx, and DistMult. We also propose subsampling methods specialized for the NS loss in KGE studied from a theoretical aspect. Our empirical analysis on the FB15k-237, WN18RR, and YAGO3-10 datasets showed that the results of actually trained models agree with our theoretical findings.
翻译:负抽样(NS)损失在学习知识图嵌入(KGE)处理大量实体方面起着重要作用。然而,KGE的性能在没有超参数的情况下降解,没有适当选择NS损失的差值期和负抽样数量等超参数。目前,实验性超参数调整以计算时间为代价解决这个问题。为了解决这个问题,我们在理论上分析了NS损失,以协助超参数调整,并理解KGE学习中更好地使用NS损失。我们的理论分析表明,具有限制值范围的评分方法,如TransE和RotateE, 需要适当调整差值期或负样品数量,与RESCAL、ComplEx和DistMult等没有限制值范围的样品不同。我们还从理论角度为KGE研究的NS损失提出了专门分抽样方法。我们关于FB15k-237、WN18RRR和YAGO3-10的实证分析表明,实际培训模型的结果与我们的理论结论一致。