Item recommendation based on historical user-item interactions is of vital importance for web-based services. However, the data used to train a recommender system (RS) suffers from severe popularity bias. Interactions of a small fraction of popular (head) items account for almost the whole training data. Normal training methods from such biased data tend to repetitively generate recommendations from the head items, which further exacerbates the data bias and affects the exploration of potentially interesting items from niche (tail) items. In this paper, we explore the central theme of long-tail recommendation. Through an empirical study, we find that head items are very likely to be recommended due to the fact that the gradients coming from head items dominate the overall gradient update process, which further affects the optimization of tail items. To this end, we propose a general framework namely Item Cluster-Wise Multi-Objective Training (ICMT) for long-tail recommendation. Firstly, the disentangled representation learning is utilized to identify the popularity impact behind user-item interactions. Then item clusters are adaptively formulated according to the disentangled popularity representation. After that, we consider the learning over the whole training data as a weighted aggregation of multiple item cluster-wise objectives, which can be resolved through a Pareto-Efficient solver for a harmonious overall gradient direction. Besides, a contractive loss focusing on model robustness is derived as a regularization term. We instantiate ICMT with three state-of-the-art recommendation models and conduct experiments on three real-world datasets. %Through alleviating the popularity bias, Experimental results demonstrate that the proposed ICMT significantly improves the overall recommendation performance, especially on tail items.
翻译:基于历史用户-项目互动关系的正常培训方法往往会重复产生基于历史用户-项目的建议,这进一步加剧了数据偏差,并影响从利基(零售)项目中探索潜在有趣项目。在本文中,我们探讨了长尾建议的核心主题。通过经验性研究,我们发现,用于培训推荐者系统(RS)的数据极有可能被推荐,因为来自头项的梯度占整个梯度更新过程的主导度,这进一步影响到尾项的优化。为此,我们提议了一个总框架,即“集群-Wise多动性培训(ICMT)”项目,用于长尾建议,这进一步加剧了数据偏重数据偏重,从而影响从利基(零售)项目对潜在有趣项目的探索。我们探讨了长尾建议的核心主题。然后,通过经验性研究,我们发现由于头项的梯度决定了整个梯度更新过程的梯度,从而进一步影响到尾项的优化过程。为此,我们考虑整个培训过程的推算结果的推算结果会大大提升了总体递升级数据,然后,我们将整个C级数据推算出一个直地标的递增级数据,从而解决了多级数据。