Contextual Combinatorial Bandits with Probabilistically Triggered Arms (Contextual Combinatorial Bandits with Probabilistically Triggered Arms) - 专知论文

会员服务 ·

0

方差 · 情境 · 赌博机/老虎机 · 算法 · 概率 ·

2023 年 3 月 30 日

Contextual Combinatorial Bandits with Probabilistically Triggered Arms

翻译：Contextual Combinatorial Bandits with Probabilistically Triggered Arms

Xutong Liu,Jinhang Zuo,Siwei Wang,John C. S. Lui,Mohammad Hajiesmaili,Adam Wierman,Wei Chen

from arxiv, arXiv admin note: text overlap with arXiv:2208.14837

We study contextual combinatorial bandits with probabilistically triggered arms (C$^2$MAB-T) under a variety of smoothness conditions that capture a wide range of applications, such as contextual cascading bandits and contextual influence maximization bandits. Under the triggering probability modulated (TPM) condition, we devise the C$^2$-UCB-T algorithm and propose a novel analysis that achieves an $\tilde{O}(d\sqrt{KT})$ regret bound, removing a potentially exponentially large factor $O(1/p_{\min})$, where $d$ is the dimension of contexts, $p_{\min}$ is the minimum positive probability that any arm can be triggered, and batch-size $K$ is the maximum number of arms that can be triggered per round. Under the variance modulated (VM) or triggering probability and variance modulated (TPVM) conditions, we propose a new variance-adaptive algorithm VAC$^2$-UCB and derive a regret bound $\tilde{O}(d\sqrt{T})$, which is independent of the batch-size $K$. As a valuable by-product, we find our analysis technique and variance-adaptive algorithm can be applied to the CMAB-T and C$^2$MAB~setting, improving existing results there as well. We also include experiments that demonstrate the improved performance of our algorithms compared with benchmark algorithms on synthetic and real-world datasets.

翻译：使用概率触发臂的情境组合赌博算法的研究（C$^2$MAB-T），其考虑多种平滑条件，其涵盖广泛的应用，例如情境级联赌博和情境影响最大化赌博。在触发概率调制（TPM）条件下，我们设计了C$^2$-UCB-T算法，并提出了一种新颖的分析方法，实现了一个 $\tilde{O}(d\sqrt{KT})$后悔上限，消除了一个潜在指数级的大因子$O(1/p_{min})$，其中$d$为情景的维数，$p_{min}$是任何臂可以被触发的最小概率，批处理大小$K$是每轮最多可以触发的臂的数量。在方差调节（VM）或触发概率和方差调节（TPVM）条件下，我们提出了一种新的方差自适应算法VAC$^2$-UCB，并得出了一个 $\tilde{O}(d\sqrt{T})$的后悔上限，该上限与批处理大小$K$无关。作为一个有价值的副产品，我们发现我们的分析技术和方差自适应算法也可以应用于CMAB-T和C$^2$MAB算法中，改善现有结果。我们还将我们的算法与基准算法在合成和实际数据集上的性能进行了比较实验，证明了我们算法比基准算法性能更优秀。

0

相关内容

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

66+阅读 · 2023年2月15日

【MIT出版社新书】提升概率推理导论，455页pdf，An Introduction to Lifted Probabilistic Inference

【MIT出版社新书】提升概率推理导论，455页pdf，An Introduction to Lifted Probabilistic Inference

专知会员服务

38+阅读 · 2022年2月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【伯克利】自回归模型的局部掩卷积，Locally Masked Convolution for Autoregressive Models

【伯克利】自回归模型的局部掩卷积，Locally Masked Convolution for Autoregressive Models

专知会员服务

20+阅读 · 2020年6月23日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

13+阅读 · 2020年6月8日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

已删除

德先生

53+阅读 · 2019年4月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

微纳结构和新颖超材料中的非对称光学传输

国家自然科学基金

0+阅读 · 2015年12月31日

基于最大相关熵准则的支持向量机模型与算法研究

国家自然科学基金

3+阅读 · 2015年12月31日

Ru催化双导向基团参与C-H键活化及官能化反应的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

组蛋白乙酰化修饰在12-脂氧化酶影响糖尿病性肾小球肥大中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

小样本空间制图

国家自然科学基金

0+阅读 · 2012年12月31日

Periostin在前列腺癌侵袭转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Smad3调控前列腺癌进展的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

基于非参数层次贝叶斯模型的自适应字典稀疏表示方法及应用

国家自然科学基金

0+阅读 · 2011年12月31日

Curcumin双向调控HO-1/HO-2协同抑制Aβeme复合物防治AD的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

Learning Global-aware Kernel for Image Harmonization

Arxiv

0+阅读 · 2023年5月19日

Implicitly normalized forecaster with clipping for linear and non-linear heavy-tailed multi-armed bandits

Arxiv

0+阅读 · 2023年5月19日

What part of a numerical problem is ill-conditioned?

Arxiv

0+阅读 · 2023年5月19日

Efficient quantum linear solver algorithm with detailed running costs

Arxiv

0+阅读 · 2023年5月19日

Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL

Arxiv

0+阅读 · 2023年5月18日

Worst-Case VCG Redistribution Mechanism Design Based on the Lottery Ticket Hypothesis

Arxiv

0+阅读 · 2023年5月18日

Two-step Newton's method for deflation-one singular zeros of analytic systems

Arxiv

0+阅读 · 2023年5月18日

Reinforcement Learning with History-Dependent Dynamic Contexts

Arxiv

0+阅读 · 2023年5月18日

Linear Query Approximation Algorithms for Non-monotone Submodular Maximization under Knapsack Constraint

Arxiv

0+阅读 · 2023年5月17日

Deep learning: a statistical viewpoint

Arxiv

18+阅读 · 2021年3月16日

VIP会员

文章信息

相关主题

赌博机/老虎机

相关VIP内容

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

66+阅读 · 2023年2月15日

【MIT出版社新书】提升概率推理导论，455页pdf，An Introduction to Lifted Probabilistic Inference

【MIT出版社新书】提升概率推理导论，455页pdf，An Introduction to Lifted Probabilistic Inference

专知会员服务

38+阅读 · 2022年2月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【伯克利】自回归模型的局部掩卷积，Locally Masked Convolution for Autoregressive Models

【伯克利】自回归模型的局部掩卷积，Locally Masked Convolution for Autoregressive Models

专知会员服务

20+阅读 · 2020年6月23日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

13+阅读 · 2020年6月8日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

数据要素发展报告(2025年)：附下载

人工智能代理提升战时舰船战备水平

【NeurIPS2025教程】大语言模型规划

NeurIPS 2025 教程：深度学习训练不稳定性的理论洞见

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

已删除

德先生

53+阅读 · 2019年4月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

相关论文

Learning Global-aware Kernel for Image Harmonization

Arxiv

0+阅读 · 2023年5月19日

Implicitly normalized forecaster with clipping for linear and non-linear heavy-tailed multi-armed bandits

Arxiv

0+阅读 · 2023年5月19日

What part of a numerical problem is ill-conditioned?

Arxiv

0+阅读 · 2023年5月19日

Efficient quantum linear solver algorithm with detailed running costs

Arxiv

0+阅读 · 2023年5月19日

Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL

Arxiv

0+阅读 · 2023年5月18日

Worst-Case VCG Redistribution Mechanism Design Based on the Lottery Ticket Hypothesis

Arxiv

0+阅读 · 2023年5月18日

Two-step Newton's method for deflation-one singular zeros of analytic systems

Arxiv

0+阅读 · 2023年5月18日

Reinforcement Learning with History-Dependent Dynamic Contexts

Arxiv

0+阅读 · 2023年5月18日

Linear Query Approximation Algorithms for Non-monotone Submodular Maximization under Knapsack Constraint

Arxiv

0+阅读 · 2023年5月17日

Deep learning: a statistical viewpoint

Arxiv

18+阅读 · 2021年3月16日

相关基金

微纳结构和新颖超材料中的非对称光学传输

国家自然科学基金

0+阅读 · 2015年12月31日

基于最大相关熵准则的支持向量机模型与算法研究

国家自然科学基金

3+阅读 · 2015年12月31日

Ru催化双导向基团参与C-H键活化及官能化反应的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

组蛋白乙酰化修饰在12-脂氧化酶影响糖尿病性肾小球肥大中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

小样本空间制图

国家自然科学基金

0+阅读 · 2012年12月31日

Periostin在前列腺癌侵袭转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Smad3调控前列腺癌进展的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

基于非参数层次贝叶斯模型的自适应字典稀疏表示方法及应用

国家自然科学基金

0+阅读 · 2011年12月31日

Curcumin双向调控HO-1/HO-2协同抑制Aβeme复合物防治AD的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员