推荐系统中基于随机数据集的系统性偏差界定 (Bounding System-Induced Biases in Recommender Systems with A Randomized Dataset) - 专知论文

会员服务 ·

0

泛函 · 目标函数 · 数据集 · 优化器 · 泛化误差上界 ·

2023 年 3 月 21 日

Bounding System-Induced Biases in Recommender Systems with A Randomized Dataset

翻译：推荐系统中基于随机数据集的系统性偏差界定

Dugang Liu,Pengxiang Cheng,Zinan Lin,Xiaolian Zhang,Zhenhua Dong,Rui Zhang,Xiuqiang He,Weike Pan,Zhong Ming

from arxiv, Accepted by ACM TOIS

Debiased recommendation with a randomized dataset has shown very promising results in mitigating the system-induced biases. However, it still lacks more theoretical insights or an ideal optimization objective function compared with the other more well studied route without a randomized dataset. To bridge this gap, we study the debiasing problem from a new perspective and propose to directly minimize the upper bound of an ideal objective function, which facilitates a better potential solution to the system-induced biases. Firstly, we formulate a new ideal optimization objective function with a randomized dataset. Secondly, according to the prior constraints that an adopted loss function may satisfy, we derive two different upper bounds of the objective function, i.e., a generalization error bound with the triangle inequality and a generalization error bound with the separability. Thirdly, we show that most existing related methods can be regarded as the insufficient optimization of these two upper bounds. Fourthly, we propose a novel method called debiasing approximate upper bound with a randomized dataset (DUB), which achieves a more sufficient optimization of these upper bounds. Finally, we conduct extensive experiments on a public dataset and a real product dataset to verify the effectiveness of our DUB.

翻译：摘要：通过随机数据集进行去偏推荐在缓解系统引起的偏差方面表现出非常良好的结果。然而，相较于其他已经更加深入研究的路线，它仍然缺乏更多的理论洞见或理想的优化目标函数。为了弥补这个差距，我们从一个新的角度研究去偏问题，并提议直接最小化理想目标函数的上界，这促进了更好的潜在解决方案来应对系统引起的偏差问题。首先，我们提出了一个带随机数据集的新理想优化目标函数。其次，根据一个采用的损失函数可能满足的先验约束，我们推导出两个不同的目标函数上界，即利用三角不等式的泛化误差上界和利用可分离性的泛化误差上界。第三，我们展示了大部分现有相关方法可以被视为这两个上界的不充分优化。第四，我们提出了一种名为带随机数据集的去偏近似上界（DUB）的新方法，它实现了这些上界的更充分优化。最后，我们对公共数据集和真实产品数据集进行了广泛的实验，以验证我们的 DUB 的有效性。

0

相关内容

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

【ICDM2022教程】多目标优化与推荐，173页ppt

【ICDM2022教程】多目标优化与推荐，173页ppt

专知会员服务

46+阅读 · 2022年12月24日

【AI+商业投资】法国兴业银行《深度强化学习在投资组合分配中的应用》26页PPT，Deep Reinforcement Learning for portfolio allocation

【AI+商业投资】法国兴业银行《深度强化学习在投资组合分配中的应用》26页PPT，Deep Reinforcement Learning for portfolio allocation

专知会员服务

24+阅读 · 2022年4月1日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

复杂的序列数据分析：现有算法的系统文献综述，Complex Sequential Data Analysis: A Systematic Literature Review of Existing Algorithms

复杂的序列数据分析：现有算法的系统文献综述，Complex Sequential Data Analysis: A Systematic Literature Review of Existing Algorithms

专知会员服务

27+阅读 · 2020年7月24日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

【推荐论文】知识图谱如何用于推荐系统？A Survey on Knowledge Graph-Based Recommender Systems

【推荐论文】知识图谱如何用于推荐系统？A Survey on Knowledge Graph-Based Recommender Systems

专知会员服务

171+阅读 · 2020年3月3日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

LibRec 精选：推荐的可解释性[综述]

LibRec 精选：推荐的可解释性[综述]

LibRec智能推荐

10+阅读 · 2018年5月4日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

主被动卫星资料联合反演气溶胶和云特性的方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

资产价格波动与实体经济：影响机制及其动态均衡研究

国家自然科学基金

0+阅读 · 2014年12月31日

具有Markov跳变参数的随机混合拟哈密顿系统的动力学与控制

国家自然科学基金

0+阅读 · 2012年12月31日

城市交通流宏观基本图特性研究及其在交通网络评价中的应用

国家自然科学基金

2+阅读 · 2012年12月31日

模拟仿真的输入不确定性及其在金融风险管理中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

上市金融机构系统性风险传导与演化机制实证与模拟研究

国家自然科学基金

0+阅读 · 2012年12月31日

因果推断的统计方法

国家自然科学基金

26+阅读 · 2011年12月31日

基于误差熵准则非线性系统参数辨识算法研究

国家自然科学基金

0+阅读 · 2009年12月31日

海浪资料同化中背景误差的随机动力学模型及其应用

国家自然科学基金

0+阅读 · 2008年12月31日

随机变系数模型的研究及其在经济学中的应用

国家自然科学基金

1+阅读 · 2008年12月31日

Sequential model correction for nonlinear inverse problems

Arxiv

0+阅读 · 2023年5月12日

Inference at Scale Significance Testing for Large Search and Recommendation Experiments

Arxiv

0+阅读 · 2023年5月12日

Uncovering ChatGPT's Capabilities in Recommender Systems

Arxiv

0+阅读 · 2023年5月11日

Frequency-Supported Neural Networks for Nonlinear Dynamical System Identification

Arxiv

0+阅读 · 2023年5月10日

Covariate-assisted bounds on causal effects with instrumental variables

Arxiv

0+阅读 · 2023年5月9日

Self-Supervised Learning for Recommender Systems: A Survey

Arxiv

12+阅读 · 2022年3月29日

Graph Neural Networks for Recommender Systems: Challenges, Methods, and Directions

Arxiv

31+阅读 · 2021年9月27日

A Survey on Reinforcement Learning for Recommender Systems

Arxiv

22+阅读 · 2021年9月22日

Characterizing Impacts of Heterogeneity in Federated Learning upon Large-Scale Smartphone Data

Arxiv

12+阅读 · 2021年2月21日

Self-correcting Q-Learning

Arxiv

11+阅读 · 2020年12月2日

VIP会员

文章信息

相关主题

泛化误差上界

相关VIP内容

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

【ICDM2022教程】多目标优化与推荐，173页ppt

【ICDM2022教程】多目标优化与推荐，173页ppt

专知会员服务

46+阅读 · 2022年12月24日

【AI+商业投资】法国兴业银行《深度强化学习在投资组合分配中的应用》26页PPT，Deep Reinforcement Learning for portfolio allocation

【AI+商业投资】法国兴业银行《深度强化学习在投资组合分配中的应用》26页PPT，Deep Reinforcement Learning for portfolio allocation

专知会员服务

24+阅读 · 2022年4月1日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

复杂的序列数据分析：现有算法的系统文献综述，Complex Sequential Data Analysis: A Systematic Literature Review of Existing Algorithms

复杂的序列数据分析：现有算法的系统文献综述，Complex Sequential Data Analysis: A Systematic Literature Review of Existing Algorithms

专知会员服务

27+阅读 · 2020年7月24日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

【推荐论文】知识图谱如何用于推荐系统？A Survey on Knowledge Graph-Based Recommender Systems

【推荐论文】知识图谱如何用于推荐系统？A Survey on Knowledge Graph-Based Recommender Systems

专知会员服务

171+阅读 · 2020年3月3日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《多域空战指挥体系：驾驭复杂性的艺术》

构建军事人工智能信任体系始于破除黑盒机制

《生态建模密码破译：建模与编程实践》美陆军最新报告

《战争形态演变：合成兵种防御主导模式探析》48页slides

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

LibRec 精选：推荐的可解释性[综述]

LibRec 精选：推荐的可解释性[综述]

LibRec智能推荐

10+阅读 · 2018年5月4日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

Sequential model correction for nonlinear inverse problems

Arxiv

0+阅读 · 2023年5月12日

Inference at Scale Significance Testing for Large Search and Recommendation Experiments

Arxiv

0+阅读 · 2023年5月12日

Uncovering ChatGPT's Capabilities in Recommender Systems

Arxiv

0+阅读 · 2023年5月11日

Frequency-Supported Neural Networks for Nonlinear Dynamical System Identification

Arxiv

0+阅读 · 2023年5月10日

Covariate-assisted bounds on causal effects with instrumental variables

Arxiv

0+阅读 · 2023年5月9日

Self-Supervised Learning for Recommender Systems: A Survey

Arxiv

12+阅读 · 2022年3月29日

Graph Neural Networks for Recommender Systems: Challenges, Methods, and Directions

Arxiv

31+阅读 · 2021年9月27日

A Survey on Reinforcement Learning for Recommender Systems

Arxiv

22+阅读 · 2021年9月22日

Characterizing Impacts of Heterogeneity in Federated Learning upon Large-Scale Smartphone Data

Arxiv

12+阅读 · 2021年2月21日

Self-correcting Q-Learning

Arxiv

11+阅读 · 2020年12月2日

相关基金

主被动卫星资料联合反演气溶胶和云特性的方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

资产价格波动与实体经济：影响机制及其动态均衡研究

国家自然科学基金

0+阅读 · 2014年12月31日

具有Markov跳变参数的随机混合拟哈密顿系统的动力学与控制

国家自然科学基金

0+阅读 · 2012年12月31日

城市交通流宏观基本图特性研究及其在交通网络评价中的应用

国家自然科学基金

2+阅读 · 2012年12月31日

模拟仿真的输入不确定性及其在金融风险管理中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

上市金融机构系统性风险传导与演化机制实证与模拟研究

国家自然科学基金

0+阅读 · 2012年12月31日

因果推断的统计方法

国家自然科学基金

26+阅读 · 2011年12月31日

基于误差熵准则非线性系统参数辨识算法研究

国家自然科学基金

0+阅读 · 2009年12月31日

海浪资料同化中背景误差的随机动力学模型及其应用

国家自然科学基金

0+阅读 · 2008年12月31日

随机变系数模型的研究及其在经济学中的应用

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员