上下文强盗,并给予具体奖赏,适用公平排名 (Contextual bandits with concave rewards, and an application to fair ranking) - 专知论文

会员服务 ·

0

赌博机/老虎机 · 上下文赌博机/上下文老虎机 · Facebook AI Research · 秩 · CASE ·

2022 年 10 月 18 日

Contextual bandits with concave rewards, and an application to fair ranking

翻译：上下文强盗,并给予具体奖赏,适用公平排名

Virginie Do,Elvis Dohmatob,Matteo Pirotta,Alessandro Lazaric,Nicolas Usunier

We consider Contextual Bandits with Concave Rewards (CBCR), a multi-objective bandit problem where the desired trade-off between the rewards is defined by a known concave objective function, and the reward vector depends on an observed stochastic context. We present the first algorithm with provably vanishing regret for CBCR without restrictions on the policy space, whereas prior works were restricted to finite policy spaces or tabular representations. Our solution is based on a geometric interpretation of CBCR algorithms as optimization algorithms over the convex set of expected rewards spanned by all stochastic policies. Building on Frank-Wolfe analyses in constrained convex optimization, we derive a novel reduction from the CBCR regret to the regret of a scalar-reward bandit problem. We illustrate how to apply the reduction off-the-shelf to obtain algorithms for CBCR with both linear and general reward functions, in the case of non-combinatorial actions. Motivated by fairness in recommendation, we describe a special case of CBCR with rankings and fairness-aware objectives, leading to the first algorithm with regret guarantees for contextual combinatorial bandits with fairness of exposure.

翻译：我们考虑的是Concave Rewards(CBCR)的“背景强盗”问题,这是一个多目标的强盗问题,其原因是,通过已知的 concave 目标功能界定了奖赏之间的预期权衡,而奖赏矢量则取决于观察到的随机环境。我们提出了第一个算法,在没有限制政策空间的情况下,CBCR对CBR的遗憾可以明显消失,而以前的工作仅限于有限的政策空间或表示。我们的解决办法是基于对CBCR算法的几何解释,将CBCR算法作为所有随机政策的预期奖赏范围组合的优化算法。在限制 convex优化的Frank-Wolfe分析的基础上,我们从CBCR得出了新颖的减法,结果就是对Scalar-Reward 土匪问题的遗憾。我们演示了如何将现在的减法用于获得CBCRCRCR的算法,在非combinal 行动方面都有线性和一般奖赏功能。受建议公平激励,我们描述了CBCCCRCRICCRind-awa relaveal laveal asim vial lagal sh

0

相关内容

赌博机/老虎机

赌博机/老虎机

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

miR-21/PDCD4/NF-κB通路在血小板抗菌促糖尿病溃疡愈合中的作用及分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

纳颗粒-纳结构复合SERS基底的生化辅助可控制造及性能增强方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

PP2Cδ调控的线粒体ROS通路在肺损伤和炎症中的作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Orexin/OX1R激动FOXO1/Atg7干预胰岛β细胞自噬的机制及其在胰岛功能缺陷中的意义

国家自然科学基金

0+阅读 · 2014年12月31日

基于PCA与二代Curvelet变换的多模态医学图像融合方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

炎症相关miRNA 及其靶序列SNPs 与胃癌发生的分子流行病学研究

国家自然科学基金

0+阅读 · 2012年12月31日

寻找多氯联苯代谢途径中缺失的一环

国家自然科学基金

0+阅读 · 2009年12月31日

MiRNA基因遗传变异与膀胱癌易感性及其分子机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

木质素酶-半纤维素酶多酶体系优化构建及其与产纤维素酶真菌协同发酵木质纤维素

国家自然科学基金

0+阅读 · 2009年12月31日

口腔白斑癌变的miRNA表达模型

国家自然科学基金

0+阅读 · 2008年12月31日

The Adversary Bound Revisited: From Optimal Query Algorithms to Optimal Control

Arxiv

0+阅读 · 2022年11月29日

On Learning Fairness and Accuracy on Multiple Subgroups

Arxiv

0+阅读 · 2022年11月29日

Quasi-stable Coloring for Graph Compression: Approximating Max-Flow, Linear Programs, and Centrality

Arxiv

0+阅读 · 2022年11月29日

Personalized Reward Learning with Interaction-Grounded Learning (IGL)

Arxiv

0+阅读 · 2022年11月28日

Minimax AUC Fairness: Efficient Algorithm with Provable Convergence

Arxiv

0+阅读 · 2022年11月28日

An FMM Accelerated Poisson Solver for Complicated Geometries in the Plane using Function Extension

Arxiv

0+阅读 · 2022年11月26日

On the Re-Solving Heuristic for (Binary) Contextual Bandits with Knapsacks

Arxiv

0+阅读 · 2022年11月25日

Bayesian Learning for Neural Networks: an algorithmic survey

Bayesian Learning for Neural Networks: an algorithmic survey

Arxiv

0+阅读 · 2022年11月24日

FairFed: Enabling Group Fairness in Federated Learning

Arxiv

0+阅读 · 2022年11月23日

Learning with Differentiable Algorithms

Arxiv

11+阅读 · 2022年9月1日

VIP会员

文章信息

相关主题

赌博机/老虎机

上下文赌博机/上下文老虎机

Facebook AI Research

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

2025生成式AI企业应用实务报告

【普林斯顿博士论文】移动计算摄影中的神经场表示

【ICML2025】SADA：稳定性引导的自适应扩散加速

LLMOps：大语言模型的生产环境管理

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

相关论文

The Adversary Bound Revisited: From Optimal Query Algorithms to Optimal Control

Arxiv

0+阅读 · 2022年11月29日

On Learning Fairness and Accuracy on Multiple Subgroups

Arxiv

0+阅读 · 2022年11月29日

Quasi-stable Coloring for Graph Compression: Approximating Max-Flow, Linear Programs, and Centrality

Arxiv

0+阅读 · 2022年11月29日

Personalized Reward Learning with Interaction-Grounded Learning (IGL)

Arxiv

0+阅读 · 2022年11月28日

Minimax AUC Fairness: Efficient Algorithm with Provable Convergence

Arxiv

0+阅读 · 2022年11月28日

An FMM Accelerated Poisson Solver for Complicated Geometries in the Plane using Function Extension

Arxiv

0+阅读 · 2022年11月26日

On the Re-Solving Heuristic for (Binary) Contextual Bandits with Knapsacks

Arxiv

0+阅读 · 2022年11月25日

Bayesian Learning for Neural Networks: an algorithmic survey

Bayesian Learning for Neural Networks: an algorithmic survey

Arxiv

0+阅读 · 2022年11月24日

FairFed: Enabling Group Fairness in Federated Learning

Arxiv

0+阅读 · 2022年11月23日

Learning with Differentiable Algorithms

Arxiv

11+阅读 · 2022年9月1日

相关基金

miR-21/PDCD4/NF-κB通路在血小板抗菌促糖尿病溃疡愈合中的作用及分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

纳颗粒-纳结构复合SERS基底的生化辅助可控制造及性能增强方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

PP2Cδ调控的线粒体ROS通路在肺损伤和炎症中的作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Orexin/OX1R激动FOXO1/Atg7干预胰岛β细胞自噬的机制及其在胰岛功能缺陷中的意义

国家自然科学基金

0+阅读 · 2014年12月31日

基于PCA与二代Curvelet变换的多模态医学图像融合方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

炎症相关miRNA 及其靶序列SNPs 与胃癌发生的分子流行病学研究

国家自然科学基金

0+阅读 · 2012年12月31日

寻找多氯联苯代谢途径中缺失的一环

国家自然科学基金

0+阅读 · 2009年12月31日

MiRNA基因遗传变异与膀胱癌易感性及其分子机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

木质素酶-半纤维素酶多酶体系优化构建及其与产纤维素酶真菌协同发酵木质纤维素

国家自然科学基金

0+阅读 · 2009年12月31日

口腔白斑癌变的miRNA表达模型

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员