大规模建议快速离线政策优化 (Fast Offline Policy Optimization for Large Scale Recommendation) - 专知论文

会员服务 ·

0

Learning · 内积 · 估计/估计量 · FAST · 缩放 ·

2022 年 8 月 8 日

Fast Offline Policy Optimization for Large Scale Recommendation

翻译：大规模建议快速离线政策优化

Otmane Sakhi,David Rohde,Alexandre Gilotte

Personalised interactive systems such as recommender systems require selecting relevant items dependent on context. Production systems need to identify the items rapidly from very large catalogues which can be efficiently solved using maximum inner product search technology. Offline optimisation of maximum inner product search can be achieved by a relaxation of the discrete problem resulting in policy learning or reinforce style learning algorithms. Unfortunately this relaxation step requires computing a sum over the entire catalogue making the complexity of the evaluation of the gradient (and hence each stochastic gradient descent iterations) linear in the catalogue size. This calculation is untenable in many real world examples such as large catalogue recommender systems severely limiting the usefulness of this method in practice. In this paper we show how it is possible to produce an excellent approximation of these policy learning algorithms that scale logarithmically with the catalogue size. Our contribution is based upon combining three novel ideas: a new Monte Carlo estimate of the gradient of a policy, the self normalised importance sampling estimator and the use of fast maximum inner product search at training time. Extensive experiments show our algorithm is an order of magnitude faster than naive approaches yet produces equally good policies.

翻译：个人化互动系统,例如建议者系统,要求根据背景选择相关物品。生产系统需要从大型目录中迅速确定能够使用最大内部产品搜索技术有效解决的物品。通过放松导致政策学习或强化风格学习算法的离散问题,可以实现最大内部产品搜索的离线优化。不幸的是,这一放松步骤要求在整个目录中计算一个总和,使对梯度(以及因此每个随机梯度梯度下坡迭代)的评价变得复杂,在目录的大小上线。在许多真实的世界实例中,这种计算是站不住脚的,例如大型目录推荐系统严重限制了这种方法在实践中的效用。在本文件中,我们展示了如何对这些政策学习算法进行极佳的近似,这种算法与目录大小成正比。我们的贡献基于三个新概念的结合:对政策的梯度进行新的蒙特卡洛估计,自我正常重要性测算,以及在培训时使用快速最大内部产品搜索。广泛的实验显示我们的算法比天真的方法要快,但却产生同样良好的政策。

0

相关内容

Learning

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

大型金属板材件表面微结构滚压成形实现及其应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

用于惯性约束聚变靶丸材料的B-C-N薄膜的ECR-PLD制备及其结构控制

国家自然科学基金

0+阅读 · 2012年12月31日

微纳结构Ag3PO4空心球/石墨烯异质结的构筑及光催化性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

熔石英元件亚表面加工缺陷形成机理与控制技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

滞回特性小的超磁致伸缩合金Tb-Dy-Ho-Fe系相图与合金的制备及其磁性研究

国家自然科学基金

0+阅读 · 2011年12月31日

磁性薄膜和磁性纳米结构中的自旋动力学研究

国家自然科学基金

0+阅读 · 2011年12月31日

小檗碱抑制阿尔茨海默病β28096;粉样蛋白的产生及其机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

多铁性薄膜及其异质结构的制备和特性研究

国家自然科学基金

0+阅读 · 2008年12月31日

果蔬拟除虫菊酯农药残留高效降解酶Cpde分子改造研究

国家自然科学基金

0+阅读 · 2008年12月31日

双酚A对发育中脑内NMDA受体的影响及其机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

A Framework for Large Scale Synthetic Graph Dataset Generation

Arxiv

0+阅读 · 2022年10月6日

Graph-augmented Learning to Rank for Querying Large-scale Knowledge Graph

Arxiv

0+阅读 · 2022年10月5日

Application of Stable Inversion to Flexible Manipulators Modeled by the ANCF

Arxiv

0+阅读 · 2022年10月4日

Calibration Matters: Tackling Maximization Bias in Large-scale Advertising Recommendation Systems

Arxiv

0+阅读 · 2022年10月2日

Inverse Online Learning: Understanding Non-Stationary and Reactionary Policies

Arxiv

0+阅读 · 2022年9月30日

Risk Control for Online Learning Models

Arxiv

0+阅读 · 2022年9月30日

Reinforced Negative Sampling over Knowledge Graph for Recommendation

Arxiv

17+阅读 · 2020年3月12日

Sequential Scenario-Specific Meta Learner for Online Recommendation

Sequential Scenario-Specific Meta Learner for Online Recommendation

Arxiv

16+阅读 · 2019年6月2日

KGAT: Knowledge Graph Attention Network for Recommendation

Arxiv

40+阅读 · 2019年5月20日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

《生态建模密码破译：建模与编程实践》美陆军最新报告

大模型解决方案白皮书：社交陪伴场景全流程落地指南

面向具身操作的视觉-语言-动作模型综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

A Framework for Large Scale Synthetic Graph Dataset Generation

Arxiv

0+阅读 · 2022年10月6日

Graph-augmented Learning to Rank for Querying Large-scale Knowledge Graph

Arxiv

0+阅读 · 2022年10月5日

Application of Stable Inversion to Flexible Manipulators Modeled by the ANCF

Arxiv

0+阅读 · 2022年10月4日

Calibration Matters: Tackling Maximization Bias in Large-scale Advertising Recommendation Systems

Arxiv

0+阅读 · 2022年10月2日

Inverse Online Learning: Understanding Non-Stationary and Reactionary Policies

Arxiv

0+阅读 · 2022年9月30日

Risk Control for Online Learning Models

Arxiv

0+阅读 · 2022年9月30日

Reinforced Negative Sampling over Knowledge Graph for Recommendation

Arxiv

17+阅读 · 2020年3月12日

Sequential Scenario-Specific Meta Learner for Online Recommendation

Sequential Scenario-Specific Meta Learner for Online Recommendation

Arxiv

16+阅读 · 2019年6月2日

KGAT: Knowledge Graph Attention Network for Recommendation

Arxiv

40+阅读 · 2019年5月20日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

相关基金

大型金属板材件表面微结构滚压成形实现及其应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

用于惯性约束聚变靶丸材料的B-C-N薄膜的ECR-PLD制备及其结构控制

国家自然科学基金

0+阅读 · 2012年12月31日

微纳结构Ag3PO4空心球/石墨烯异质结的构筑及光催化性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

熔石英元件亚表面加工缺陷形成机理与控制技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

滞回特性小的超磁致伸缩合金Tb-Dy-Ho-Fe系相图与合金的制备及其磁性研究

国家自然科学基金

0+阅读 · 2011年12月31日

磁性薄膜和磁性纳米结构中的自旋动力学研究

国家自然科学基金

0+阅读 · 2011年12月31日

小檗碱抑制阿尔茨海默病β28096;粉样蛋白的产生及其机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

多铁性薄膜及其异质结构的制备和特性研究

国家自然科学基金

0+阅读 · 2008年12月31日

果蔬拟除虫菊酯农药残留高效降解酶Cpde分子改造研究

国家自然科学基金

0+阅读 · 2008年12月31日

双酚A对发育中脑内NMDA受体的影响及其机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员