重新思考神经力与矩阵-因素-合作过滤:理论视角 (Rethinking Neural vs. Matrix-Factorization Collaborative Filtering: the Theoretical Perspectives)

The recent work by Rendle et al. (2020), based on empirical observations, argues that matrix-factorization collaborative filtering (MCF) compares favorably to neural collaborative filtering (NCF), and conjectures the dot product's superiority over the feed-forward neural network as similarity function. In this paper, we address the comparison rigorously by answering the following questions: 1. what is the limiting expressivity of each model; 2. under the practical gradient descent, to which solution does each optimization path converge; 3. how would the models generalize under the inductive and transductive learning setting. Our results highlight the similar expressivity for the overparameterized NCF and MCF as kernelized predictors, and reveal the relation between their optimization paths. We further show their different generalization behaviors, where MCF and NCF experience specific tradeoff and comparison in the transductive and inductive collaborative filtering setting. Lastly, by showing a novel generalization result, we reveal the critical role of correcting exposure bias for model evaluation in the inductive setting. Our results explain some of the previously observed conflicts, and we provide synthetic and real-data experiments to shed further insights to this topic.

翻译：根据经验观察,Rendle等人(2020年)最近根据经验观察开展的工作认为,矩阵因素合作过滤(MCF)与神经合作过滤(NCF)相比有利,并推测点产品相对于进进取神经网络的优势是相似功能。在本文件中,我们严谨地处理这一比较,回答下列问题:1. 每种模型的表达度是有限的;2. 在实际的梯度下,每种优化路径都能找到解决办法;3. 模型如何在进化和转导学习设置下概括化。我们的结果突出了过度量化的NCF和MCF作为内向型预测器的类似表达性,并揭示了它们之间的优化路径关系。我们进一步展示了它们不同的概括行为,即MCFF和NCF在转导和感化协作过滤设置中经历了具体的交换和比较。最后,通过展示新的概括性结果,我们揭示了在进化环境中纠正暴露暴露偏向模式评估的关键作用。我们的成果解释了以往观察到的冲突的一些初步认识,我们提供了合成和真实的实验。

相关内容

协同过滤

关注 224

协同过滤（英语：Collaborative Filtering），简单来说是利用某兴趣相投、拥有共同经验之群体的喜好来推荐用户感兴趣的信息，个人透过合作的机制给予信息相当程度的回应（如评分）并记录下来以达到过滤的目的进而帮助别人筛选信息，回应不一定局限于特别感兴趣的，特别不感兴趣信息的纪录也相当重要。协同过滤又可分为评比（rating）或者群体过滤（social filtering）。其后成为电子商务当中很重要的一环，即根据某顾客以往的购买行为以及从具有相似购买行为的顾客群的购买行为去推荐这个顾客其“可能喜欢的品项”，也就是借由社群的喜好提供个人化的信息、商品等的推荐服务。除了推荐之外，近年来也发展出数学运算让系统自动计算喜好的强弱进而去芜存菁使得过滤的内容更有依据，也许不是百分之百完全准确，但由于加入了强弱的评比让这个概念的应用更为广泛，除了电子商务之外尚有信息检索领域、网络个人影音柜、个人书架等的应用等。

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日