Thompson 高五分形浅度线性内地强盗抽样 (Thompson Sampling for High-Dimensional Sparse Linear Contextual Bandits) - 专知论文

会员服务 ·

0

上下文赌博机/上下文老虎机 · 赌博机/老虎机 · 线性的 · 样本 · 稀疏 ·

2023 年 1 月 28 日

Thompson Sampling for High-Dimensional Sparse Linear Contextual Bandits

翻译：Thompson 高五分形浅度线性内地强盗抽样

Sunrit Chakraborty,Saptarshi Roy,Ambuj Tewari

from arxiv, 38 pages, 4 figures

We consider the stochastic linear contextual bandit problem with high-dimensional features. We analyze the Thompson sampling algorithm using special classes of sparsity-inducing priors (e.g., spike-and-slab) to model the unknown parameter and provide a nearly optimal upper bound on the expected cumulative regret. To the best of our knowledge, this is the first work that provides theoretical guarantees of Thompson sampling in high-dimensional and sparse contextual bandits. For faster computation, we use variational inference instead of Markov Chain Monte Carlo (MCMC) to approximate the posterior distribution. Extensive simulations demonstrate the improved performance of our proposed algorithm over existing ones.

翻译：我们考虑的是具有高维特征的随机线性线性背景土匪问题。我们用特殊种类的聚变诱导前科(如钉钉和板)来分析汤普森取样算法,以模拟未知参数,并根据预期累积遗憾提供近乎最佳的上限。据我们所知,这是首次为高维和分散背景土匪的汤普森取样提供理论保障的工作。为了更快的计算,我们使用变异推理法,而不是Markov链蒙特卡洛(MCMC)来估计后方分布。广泛的模拟表明我们提议的算法相对于现有算法的性能有所改善。

0

相关内容

上下文赌博机/上下文老虎机

上下文赌博机/上下文老虎机

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

72+阅读 · 2022年7月11日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

专知

25+阅读 · 2018年4月29日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

有氧运动通过LncRNAs调控miR-492/resistin表达改善主动脉内皮胰岛素抵抗的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

IFN-τ调控奶牛子宫内膜上皮细胞差异表达BoLA-I的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

人滋养层细胞表面抗原-2调控胆囊癌细胞增殖、侵袭和转移的作用及其机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

p型氧化物透明薄膜晶体管的研制

国家自然科学基金

0+阅读 · 2012年12月31日

肝特异性循环exosomes的miRNA谱：一种潜在的HCC筛选标志

国家自然科学基金

0+阅读 · 2012年12月31日

非定常流体力学方程基于特征正交分解及自适应网格加密的外推降维数值解法研究

国家自然科学基金

0+阅读 · 2012年12月31日

长链非编码RNA MEG3抑制NSCLC顺铂耐药机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

甲基苯丙胺依赖鼠相关脑区NGF、BDNF、NT3、NT4的表达及天麻素的保护作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

N掺杂p型氧化锡纳米晶的制备及其图案化成膜

国家自然科学基金

0+阅读 · 2009年12月31日

水热条件下合成单分散超细包裹型陶瓷色料及包裹机理的研究

国家自然科学基金

0+阅读 · 2008年12月31日

Time Series Contrastive Learning with Information-Aware Augmentations

Time Series Contrastive Learning with Information-Aware Augmentations

Arxiv

0+阅读 · 2023年3月21日

Contextual Linear Bandits under Noisy Features: Towards Bayesian Oracles

Arxiv

0+阅读 · 2023年3月21日

Contextual Bandits with Packing and Covering Constraints: A Modular Lagrangian Approach via Regression

Arxiv

0+阅读 · 2023年3月20日

Solving High-Dimensional Inverse Problems with Auxiliary Uncertainty via Operator Learning with Limited Data

Arxiv

0+阅读 · 2023年3月20日

Posterior Representations for Bayesian Context Trees: Sampling, Estimation and Convergence

Arxiv

0+阅读 · 2023年3月20日

The linear sampling method for random sources

Arxiv

0+阅读 · 2023年3月20日

High-Dimensional Dynamic Pricing under Non-Stationarity: Learning and Earning with Change-Point Detection

Arxiv

0+阅读 · 2023年3月20日

Optimal and Safe Estimation for High-Dimensional Semi-Supervised Learning

Arxiv

0+阅读 · 2023年3月18日

Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning

Arxiv

0+阅读 · 2023年3月17日

Knowledge Graph Alignment Network with Gated Multi-hop Neighborhood Aggregation

Arxiv

19+阅读 · 2019年11月20日

VIP会员

文章信息

相关主题

上下文赌博机/上下文老虎机

赌博机/老虎机

相关VIP内容

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

72+阅读 · 2022年7月11日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

专知

25+阅读 · 2018年4月29日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Time Series Contrastive Learning with Information-Aware Augmentations

Time Series Contrastive Learning with Information-Aware Augmentations

Arxiv

0+阅读 · 2023年3月21日

Contextual Linear Bandits under Noisy Features: Towards Bayesian Oracles

Arxiv

0+阅读 · 2023年3月21日

Contextual Bandits with Packing and Covering Constraints: A Modular Lagrangian Approach via Regression

Arxiv

0+阅读 · 2023年3月20日

Solving High-Dimensional Inverse Problems with Auxiliary Uncertainty via Operator Learning with Limited Data

Arxiv

0+阅读 · 2023年3月20日

Posterior Representations for Bayesian Context Trees: Sampling, Estimation and Convergence

Arxiv

0+阅读 · 2023年3月20日

The linear sampling method for random sources

Arxiv

0+阅读 · 2023年3月20日

High-Dimensional Dynamic Pricing under Non-Stationarity: Learning and Earning with Change-Point Detection

Arxiv

0+阅读 · 2023年3月20日

Optimal and Safe Estimation for High-Dimensional Semi-Supervised Learning

Arxiv

0+阅读 · 2023年3月18日

Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning

Arxiv

0+阅读 · 2023年3月17日

Knowledge Graph Alignment Network with Gated Multi-hop Neighborhood Aggregation

Arxiv

19+阅读 · 2019年11月20日

相关基金

有氧运动通过LncRNAs调控miR-492/resistin表达改善主动脉内皮胰岛素抵抗的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

IFN-τ调控奶牛子宫内膜上皮细胞差异表达BoLA-I的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

人滋养层细胞表面抗原-2调控胆囊癌细胞增殖、侵袭和转移的作用及其机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

p型氧化物透明薄膜晶体管的研制

国家自然科学基金

0+阅读 · 2012年12月31日

肝特异性循环exosomes的miRNA谱：一种潜在的HCC筛选标志

国家自然科学基金

0+阅读 · 2012年12月31日

非定常流体力学方程基于特征正交分解及自适应网格加密的外推降维数值解法研究

国家自然科学基金

0+阅读 · 2012年12月31日

长链非编码RNA MEG3抑制NSCLC顺铂耐药机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

甲基苯丙胺依赖鼠相关脑区NGF、BDNF、NT3、NT4的表达及天麻素的保护作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

N掺杂p型氧化锡纳米晶的制备及其图案化成膜

国家自然科学基金

0+阅读 · 2009年12月31日

水热条件下合成单分散超细包裹型陶瓷色料及包裹机理的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员