Properly handling missing data is a fundamental challenge in recommendation. Most present works perform negative sampling from unobserved data to supply the training of recommender models with negative signals. Nevertheless, existing negative sampling strategies, either static or adaptive ones, are insufficient to yield high-quality negative samples --- both informative to model training and reflective of user real needs. In this work, we hypothesize that item knowledge graph (KG), which provides rich relations among items and KG entities, could be useful to infer informative and factual negative samples. Towards this end, we develop a new negative sampling model, Knowledge Graph Policy Network (KGPolicy), which works as a reinforcement learning agent to explore high-quality negatives. Specifically, by conducting our designed exploration operations, it navigates from the target positive interaction, adaptively receives knowledge-aware negative signals, and ultimately yields a potential negative item to train the recommender. We tested on a matrix factorization (MF) model equipped with KGPolicy, and it achieves significant improvements over both state-of-the-art sampling methods like DNS and IRGAN, and KG-enhanced recommender models like KGAT. Further analyses from different angles provide insights of knowledge-aware sampling. We release the codes and datasets at https://github.com/xiangwang1223/kgpolicy.
翻译:正确处理缺失的数据是建议方面的一个基本挑战。多数目前的工作都从未观察的数据中进行负面抽样,以提供负面信号来培训建议者模型;然而,现有的消极抽样战略,无论是静态还是适应性战略,都不足以产生高质量的负面样本 -- -- 信息丰富,可以进行模拟培训,反映用户的实际需求。在这项工作中,我们假设提供各项目和KG实体之间丰富关系的物品知识图(KG)可能有益于推断信息性和事实负面样本。为此,我们开发了新的负面抽样模型,即知识图表政策网(KGPolicy),作为强化学习工具,探索高质量的负面数据。具体来说,通过开展我们设计的勘探行动,它从目标的积极互动中导航,适应性地接收知识认知负面信号,并最终产生潜在负面项目来培训建议者。我们测试了一个配有KGPL政策模型的矩阵化(MF)模型,并在诸如DNS和IRGAN以及KG-enchanceed 等州级抽样方法方面都取得了显著的改进。我们从KAAT/Sangximing sexionals exional developments。