提升强盗 (Uplifting Bandits) - 专知论文

会员服务 ·

0

赌博机/老虎机 · 估计/估计量 · Agent · motivation · MoDELS ·

2022 年 6 月 8 日

Uplifting Bandits

翻译：提升强盗

Yu-Guan Hsieh,Shiva Prasad Kasiviswanathan,Branislav Kveton

We introduce a multi-armed bandit model where the reward is a sum of multiple random variables, and each action only alters the distributions of some of them. After each action, the agent observes the realizations of all the variables. This model is motivated by marketing campaigns and recommender systems, where the variables represent outcomes on individual customers, such as clicks. We propose UCB-style algorithms that estimate the uplifts of the actions over a baseline. We study multiple variants of the problem, including when the baseline and affected variables are unknown, and prove sublinear regret bounds for all of these. We also provide lower bounds that justify the necessity of our modeling assumptions. Experiments on synthetic and real-world datasets show the benefit of methods that estimate the uplifts over policies that do not use this structure.

翻译：我们引入了一个多武装的土匪模型, 奖赏是多个随机变量的总和, 而每个动作只改变其中某些变量的分布。每次行动之后, 代理人都会观察所有变量的实现情况。这个模型的动机是营销运动和建议系统, 变量代表单个客户的结果, 比如点击。我们建议使用UCB式的算法来估计行动在基线上的提升。我们研究问题的多种变方, 包括当基线和受影响变量未知时, 并证明这些变量的次线性后悔。我们还提供了更低的界限来证明我们模型假设的必要性。对合成和真实世界数据集的实验显示了估算不使用此结构的政策的提升方法的好处。

0

相关内容

赌博机/老虎机

赌博机/老虎机

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

miR-98/PDGF-BB/Sp7反馈调控环路在骨质疏松症发病机制中的作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

miRNAs介导CD147基因3’-UTR多态性对缺血性脑卒中的关联研究

国家自然科学基金

0+阅读 · 2013年12月31日

网构软件的按需部署关键技术研究

国家自然科学基金

1+阅读 · 2013年12月31日

随机泛函微分方程的动力学性态

国家自然科学基金

0+阅读 · 2012年12月31日

纳米粒子毛细管电泳/电色谱技术应用于元素形态分析

国家自然科学基金

0+阅读 · 2012年12月31日

轻金属硼基氢化物复合材料的制备及储氢性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

±800kV特高压直流输电线路保护和故障测距研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于分子量子点的自旋输运理论研究

国家自然科学基金

0+阅读 · 2009年12月31日

离子与生物介质相互作用的随机动力学和输运理论

国家自然科学基金

0+阅读 · 2009年12月31日

Maillard反应修饰的鲢鱼寡肽抗氧化活性及其结构特征研究

国家自然科学基金

0+阅读 · 2009年12月31日

Laplacian-based Cluster-Contractive t-SNE for High Dimensional Data Visualization

Arxiv

0+阅读 · 2022年7月25日

Modelling matrix time series via a tensor CP-decomposition

Arxiv

0+阅读 · 2022年7月25日

Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference

Arxiv

0+阅读 · 2022年7月23日

Motion Planning and Control for Multi Vehicle Autonomous Racing at High Speeds

Arxiv

0+阅读 · 2022年7月22日

Delayed Feedback in Generalised Linear Bandits Revisited

Arxiv

0+阅读 · 2022年7月21日

Designing An Illumination-Aware Network for Deep Image Relighting

Arxiv

0+阅读 · 2022年7月21日

Tight Bounds for Monotone Minimal Perfect Hashing

Arxiv

0+阅读 · 2022年7月21日

The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games

Arxiv

0+阅读 · 2022年7月21日

FOCUS: Fairness via Agent-Awareness for Federated Learning on Heterogeneous Data

Arxiv

0+阅读 · 2022年7月21日

Gaussian Process Uniform Error Bounds with Unknown Hyperparameters for Safety-Critical Applications

Arxiv

0+阅读 · 2022年7月20日

VIP会员

文章信息

相关主题

赌博机/老虎机

估计/估计量

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能治理的未来

模态感知的特征匹配：单一模态与跨模态技术的全面综述

无监督行人重识别研究综述

【牛津博士论文】面向神经影像应用的可扩展且可解释的空间模型

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Laplacian-based Cluster-Contractive t-SNE for High Dimensional Data Visualization

Arxiv

0+阅读 · 2022年7月25日

Modelling matrix time series via a tensor CP-decomposition

Arxiv

0+阅读 · 2022年7月25日

Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference

Arxiv

0+阅读 · 2022年7月23日

Motion Planning and Control for Multi Vehicle Autonomous Racing at High Speeds

Arxiv

0+阅读 · 2022年7月22日

Delayed Feedback in Generalised Linear Bandits Revisited

Arxiv

0+阅读 · 2022年7月21日

Designing An Illumination-Aware Network for Deep Image Relighting

Arxiv

0+阅读 · 2022年7月21日

Tight Bounds for Monotone Minimal Perfect Hashing

Arxiv

0+阅读 · 2022年7月21日

The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games

Arxiv

0+阅读 · 2022年7月21日

FOCUS: Fairness via Agent-Awareness for Federated Learning on Heterogeneous Data

Arxiv

0+阅读 · 2022年7月21日

Gaussian Process Uniform Error Bounds with Unknown Hyperparameters for Safety-Critical Applications

Arxiv

0+阅读 · 2022年7月20日

相关基金

miR-98/PDGF-BB/Sp7反馈调控环路在骨质疏松症发病机制中的作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

miRNAs介导CD147基因3’-UTR多态性对缺血性脑卒中的关联研究

国家自然科学基金

0+阅读 · 2013年12月31日

网构软件的按需部署关键技术研究

国家自然科学基金

1+阅读 · 2013年12月31日

随机泛函微分方程的动力学性态

国家自然科学基金

0+阅读 · 2012年12月31日

纳米粒子毛细管电泳/电色谱技术应用于元素形态分析

国家自然科学基金

0+阅读 · 2012年12月31日

轻金属硼基氢化物复合材料的制备及储氢性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

±800kV特高压直流输电线路保护和故障测距研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于分子量子点的自旋输运理论研究

国家自然科学基金

0+阅读 · 2009年12月31日

离子与生物介质相互作用的随机动力学和输运理论

国家自然科学基金

0+阅读 · 2009年12月31日

Maillard反应修饰的鲢鱼寡肽抗氧化活性及其结构特征研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员