In an era of information explosion, recommendation systems play an important role in people's daily life by facilitating content exploration. It is known that user activeness, i.e., number of behaviors, tends to follow a long-tail distribution, where the majority of users are with low activeness. In practice, we observe that tail users suffer from significantly lower-quality recommendation than the head users after joint training. We further identify that a model trained on tail users separately still achieve inferior results due to limited data. Though long-tail distributions are ubiquitous in recommendation systems, improving the recommendation performance on the tail users still remains challenge in both research and industry. Directly applying related methods on long-tail distribution might be at risk of hurting the experience of head users, which is less affordable since a small portion of head users with high activeness contribute a considerate portion of platform revenue. In this paper, we propose a novel approach that significantly improves the recommendation performance of the tail users while achieving at least comparable performance for the head users over the base model. The essence of this approach is a novel Gradient Aggregation technique that learns common knowledge shared by all users into a backbone model, followed by separate plugin prediction networks for the head users and the tail users personalization. As for common knowledge learning, we leverage the backward adjustment from the causality theory for deconfounding the gradient estimation and thus shielding off the backbone training from the confounder, i.e., user activeness. We conduct extensive experiments on two public recommendation benchmark datasets and a large-scale industrial datasets collected from the Alipay platform. Empirical studies validate the rationality and effectiveness of our approach.
翻译:在信息爆炸时代,建议系统通过便利内容的探索,在人们日常生活中发挥重要作用。众所周知,用户的积极性,即行为数量,往往遵循长尾分发,而大多数用户的活跃程度较低。在实践中,我们观察到尾端用户在联合培训后受到的推荐质量比主用户低得多。我们进一步发现,由于数据有限,在尾端用户中单独培训的模型仍然取得低劣的结果。虽然建议系统中长尾用户的分布普遍存在理性,但提高尾尾端用户的建议性能仍然是研究和行业两方面的挑战。在长尾分发方面直接采用相关方法可能会损害主用户的经验,因为大多数用户的活跃程度较低。在实践中,我们发现尾端用户比主用户得到的建议性能要低得多,同时在基础模型中为主用户取得至少可比的绩效。这一方法的精髓是新型的 GradiGargregation 技术,在远程分发时,直接应用相关方法可能会损害主端用户的经验,而这种方法则不太负担得起。因此,由于高端用户对平台进行深层次分析,因此,我们从普通的用户对正尾部用户进行一项深度分析研究,我们从基础学习了内部分析,然后学习。