Collaborative Metric Learning (CML) has recently emerged as a popular method in recommendation systems (RS), closing the gap between metric learning and Collaborative Filtering. Following the convention of RS, existing methods exploit unique user representation in their model design. This paper focuses on a challenging scenario where a user has multiple categories of interests. Under this setting, we argue that the unique user representation might induce preference bias, especially when the item category distribution is imbalanced. To address this issue, we propose a novel method called \textit{Diversity-Promoting Collaborative Metric Learning} (DPCML), with the hope of considering the commonly ignored minority interest of the user. The key idea behind DPCML is to include a multiple set of representations for each user in the system. Based on this embedding paradigm, user preference toward an item is aggregated from different embeddings by taking the minimum item-user distance among the user embedding set. Furthermore, we observe that the diversity of the embeddings for the same user also plays an essential role in the model. To this end, we propose a \textit{diversity control regularization} term to accommodate the multi-vector representation strategy better. Theoretically, we show that DPCML could generalize well to unseen test data by tackling the challenge of the annoying operation that comes from the minimum value. Experiments over a range of benchmark datasets speak to the efficacy of DPCML.
翻译:合作计量学习(CML)最近成为建议系统的一种流行方法,缩小了标准学习与协作过滤之间的差距。在塞族共和国公约之后,现有方法在模型设计中利用了独特的用户代表。本文件侧重于用户具有多种兴趣的具有挑战性的情景。在这一背景下,我们争辩说,独特的用户代表制可能会引发偏向偏向,特别是在项目类别分布不平衡的情况下。为了解决这一问题,我们提议了一种名为\ textit{多样性-促进协作计量学习}(DPCL)的新方法,希望考虑到用户通常忽略的少数兴趣。DPCML背后的主要想法是,为系统中的每个用户提供一套多重代表。基于这种嵌入式,用户对一个项目的偏好通过不同嵌入式来汇总,办法是在用户嵌入的分类中采用最小的项目用户距离。此外,我们注意到,同一用户的嵌入方式在模型中也发挥着必不可少的作用。为此,我们提议,从一个“生物多样性控制”固定化的术语到系统内每个用户的多层次数据代表制中,可以更好地体现一个多层次的数据代表制。