Small Total-Cost Constraints in Contextual Bandits with Knapsacks, with Application to Fairness - 专知论文

会员服务 ·

0

上下文赌博机/上下文老虎机 · 赌博机/老虎机 · 约束 · 代价 · Facebook AI Research ·

2023 年 5 月 25 日

Small Total-Cost Constraints in Contextual Bandits with Knapsacks, with Application to Fairness

翻译：暂无翻译

Evgenii Chzhen,Christophe Giraud,Zhen Li,Gilles Stoltz

We consider contextual bandit problems with knapsacks [CBwK], a problem where at each round, a scalar reward is obtained and vector-valued costs are suffered. The learner aims to maximize the cumulative rewards while ensuring that the cumulative costs are lower than some predetermined cost constraints. We assume that contexts come from a continuous set, that costs can be signed, and that the expected reward and cost functions, while unknown, may be uniformly estimated -- a typical assumption in the literature. In this setting, total cost constraints had so far to be at least of order $T^{3/4}$, where $T$ is the number of rounds, and were even typically assumed to depend linearly on $T$. We are however motivated to use CBwK to impose a fairness constraint of equalized average costs between groups: the budget associated with the corresponding cost constraints should be as close as possible to the natural deviations, of order $\sqrt{T}$. To that end, we introduce a dual strategy based on projected-gradient-descent updates, that is able to deal with total-cost constraints of the order of $\sqrt{T}$ up to poly-logarithmic terms. This strategy is more direct and simpler than existing strategies in the literature. It relies on a careful, adaptive, tuning of the step size.

翻译：暂无翻译

0

相关内容

上下文赌博机/上下文老虎机

上下文赌博机/上下文老虎机

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

miR-124通过EGR1调控糖尿病肾病进展及肾脏纤维化的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

向量优化问题的近似解的最优性条件

国家自然科学基金

0+阅读 · 2012年12月31日

锂空电池钙钛矿型镧锶钴氧分级介孔纳米线电催化性能与机理

国家自然科学基金

0+阅读 · 2012年12月31日

标记纳米合金催化共振散射光谱方法检测肿瘤标志物

国家自然科学基金

0+阅读 · 2011年12月31日

《软件学报》学术期刊

国家自然科学基金

6+阅读 · 2011年12月31日

Model-Assisted Probabilistic Safe Adaptive Control With Meta-Bayesian Learning

Arxiv

0+阅读 · 2023年7月13日

Your College Dorm and Dormmates: Fair Resource Sharing with Externalities

Arxiv

0+阅读 · 2023年7月13日

Numerical methods for rectangular multiparameter eigenvalue problems, with applications to finding optimal ARMA and LTI models

Arxiv

0+阅读 · 2023年7月12日

Group Fairness in Social Choice

Arxiv

0+阅读 · 2023年7月12日

Divergence Based Quadrangle and Applications

Arxiv

0+阅读 · 2023年7月12日

VIP会员

文章信息

相关主题

上下文赌博机/上下文老虎机

赌博机/老虎机

Facebook AI Research

相关VIP内容

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【ACML2025教程】迈向鲁棒且可信的大语言模型：问题与缓解策略

《利用人工智能改善军事警察行动：当下现状探索》最新95页报告

Google《AI智能体企业应用手册报告》，46页pdf

面向现代武装力量的高级AI驱动军事模拟与训练软件

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

相关论文

Model-Assisted Probabilistic Safe Adaptive Control With Meta-Bayesian Learning

Arxiv

0+阅读 · 2023年7月13日

Your College Dorm and Dormmates: Fair Resource Sharing with Externalities

Arxiv

0+阅读 · 2023年7月13日

Numerical methods for rectangular multiparameter eigenvalue problems, with applications to finding optimal ARMA and LTI models

Arxiv

0+阅读 · 2023年7月12日

Group Fairness in Social Choice

Arxiv

0+阅读 · 2023年7月12日

Divergence Based Quadrangle and Applications

Arxiv

0+阅读 · 2023年7月12日

相关基金

miR-124通过EGR1调控糖尿病肾病进展及肾脏纤维化的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

向量优化问题的近似解的最优性条件

国家自然科学基金

0+阅读 · 2012年12月31日

锂空电池钙钛矿型镧锶钴氧分级介孔纳米线电催化性能与机理

国家自然科学基金

0+阅读 · 2012年12月31日

标记纳米合金催化共振散射光谱方法检测肿瘤标志物

国家自然科学基金

0+阅读 · 2011年12月31日

《软件学报》学术期刊

国家自然科学基金

6+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员