上下文感知多臂老虎机：带装箱和覆盖约束的模块化Lagrangian方法 via 回归 (Contextual Bandits with Packing and Covering Constraints: A Modular Lagrangian Approach via Regression) - 专知论文

会员服务 ·

0

上下文赌博机/上下文老虎机 · 赌博机/老虎机 · Packing · 约束 · FOCS ·

2023 年 3 月 20 日

Contextual Bandits with Packing and Covering Constraints: A Modular Lagrangian Approach via Regression

翻译：上下文感知多臂老虎机：带装箱和覆盖约束的模块化Lagrangian方法 via 回归

Aleksandrs Slivkins,Karthik Abinav Sankararaman,Dylan Foster

We consider a variant of contextual bandits in which the algorithm consumes multiple resources subject to linear constraints on total consumption. This problem generalizes contextual bandits with knapsacks (CBwK), allowing for packing and covering constraints, as well as positive and negative resource consumption. We present a new algorithm that is simple, computationally efficient, and admits vanishing regret. It is statistically optimal for CBwK when an algorithm must stop once some constraint is violated. Our algorithm builds on LagrangeBwK (Immorlica et al., FOCS 2019) , a Lagrangian-based technique for CBwK, and SquareCB (Foster and Rakhlin, ICML 2020), a regression-based technique for contextual bandits. Our analysis leverages the inherent modularity of both techniques.

翻译：我们考虑上下文感知多臂老虎机算法的变体，其中算法消耗多个资源，受到总消耗的线性约束。该问题推广了带背包的上下文感知多臂老虎机（CBwK），允许装箱和覆盖约束，以及正和负资源消耗。我们提出了一种新算法，它简单、计算效率高，并且具有逐渐减小的遗憾。当一个算法必须在某个约束条件被违反时停止时，它在CBwK上是统计上最优的。我们的算法建立在LagrangeBwK（Immorlica等人，FOCS 2019）和SquareCB（Foster和Rakhlin，ICML 2020）之上，这是一种基于Lagrangian的技术和一种基于回归的技术。我们的分析利用了两种技术的固有模块性。

0

相关内容

上下文赌博机/上下文老虎机

上下文赌博机/上下文老虎机

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

126+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【硬核书】树与网络上的概率，716页pdf

【硬核书】树与网络上的概率，716页pdf

专知会员服务

77+阅读 · 2021年12月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

可积系统的代数与几何结构

国家自然科学基金

0+阅读 · 2013年12月31日

约束Lp正则化问题算法及应用

国家自然科学基金

0+阅读 · 2012年12月31日

光滑函数类上的几个逼近问题

国家自然科学基金

0+阅读 · 2012年12月31日

Takagi-Sugeno 模糊广义系统逼近原理的研究

国家自然科学基金

0+阅读 · 2011年12月31日

Sparse Positive-Definite Estimation for Large Covariance Matrices with Repeated Measurements

Arxiv

0+阅读 · 2023年5月11日

Domain Agnostic Image-to-image Translation using Low-Resolution Conditioning

Arxiv

0+阅读 · 2023年5月11日

Mixing a Covert and a Non-Covert User

Arxiv

0+阅读 · 2023年5月10日

ProxMaP: Proximal Occupancy Map Prediction for Efficient Indoor Robot Navigation

Arxiv

0+阅读 · 2023年5月10日

Convergence of a Normal Map-based Prox-SGD Method under the KL Inequality

Arxiv

0+阅读 · 2023年5月10日

VIP会员

文章信息

相关主题

上下文赌博机/上下文老虎机

赌博机/老虎机

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

126+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【硬核书】树与网络上的概率，716页pdf

【硬核书】树与网络上的概率，716页pdf

专知会员服务

77+阅读 · 2021年12月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

生成式人工智能导论：可靠性、负责任开发及实际应用（第二版）

《2025财年美陆军转型倡议（ATI）部队结构与组织提案》

【CMU博士论文】分布偏移下的可信机器学习

智能体 EDA 的曙光：自主数字芯片设计综述

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Sparse Positive-Definite Estimation for Large Covariance Matrices with Repeated Measurements

Arxiv

0+阅读 · 2023年5月11日

Domain Agnostic Image-to-image Translation using Low-Resolution Conditioning

Arxiv

0+阅读 · 2023年5月11日

Mixing a Covert and a Non-Covert User

Arxiv

0+阅读 · 2023年5月10日

ProxMaP: Proximal Occupancy Map Prediction for Efficient Indoor Robot Navigation

Arxiv

0+阅读 · 2023年5月10日

Convergence of a Normal Map-based Prox-SGD Method under the KL Inequality

Arxiv

0+阅读 · 2023年5月10日

相关基金

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

可积系统的代数与几何结构

国家自然科学基金

0+阅读 · 2013年12月31日

约束Lp正则化问题算法及应用

国家自然科学基金

0+阅读 · 2012年12月31日

光滑函数类上的几个逼近问题

国家自然科学基金

0+阅读 · 2012年12月31日

Takagi-Sugeno 模糊广义系统逼近原理的研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员