Modeling user interest accurately is crucial to recommendation systems. Existing works capture user interest from historical behaviors. Due to the sparsity and noise in user behavior data, behavior based models learn incomplete and sometimes inaccurate preference patterns and easily suffer from the cold user problem. To address these issues, we propose a social graph enhanced framework for behavior based models, namely Social4Rec. The social graph involving multiple relation types is extracted to find users with similar interests. It is challenging due to the trivial and sparse relations in social graph. Specifically, we first propose a Cluster-Calibrate-Merge network (CCM) to discover interest groups satisfying three properties: intrinsic self-organizing patterns through cluster layer, robustness to sparse relations through knowledge distillation of calibrator layer. We then use the averaged user interest representation within each group from CCM to complete each user behavior embedding and obtain relation specific interest aware embedding. To alleviate the trivial relation problem, relation specific interest aware embedding are aggregated among relation types through attention mechanism to obtain the interest aware social embedding for each user. It is combined with user behavior embedding to derive the matching score between the user and item. Both offline and online experiments on Tencent Video, which is one of the biggest video recommendation platforms with nearly one billion users in China, demonstrate the superiority of our work, especially for cold users. The codes are available on https://github.com/Social4Rec/SocialN
翻译:为解决这些问题,我们提议了一个基于行为模式的社会图表强化框架,即Social4Rec。 涉及多种关系类型的社会图表被提取,以找到具有类似利益的用户。由于社会图表中的微小和稀薄关系,这具有挑战性。具体地说,我们首先提议建立一个集群-网络(CCM),以发现满足以下三个属性的利益群体:通过集群层的内在自我组织模式,通过知识蒸馏校准层对稀薄关系保持稳健。然后我们使用CCM中每个群体的平均用户兴趣代表,以完成每个用户的嵌入行为,并获得特定兴趣嵌入关系。为了缓解细小关系问题,通过关注机制将特定兴趣嵌入关系在关系类型中进行汇总,以获得每个用户对社交嵌入的兴趣。它与用户嵌入三个属性:通过集群层的内在自我组织模式,通过校准校准校准校准层的知识,强度与稀薄关系关系关系。 我们随后使用CMM公司每个群体中的平均用户利益代表度, 在线实验中最接近10亿个用户。