Thompson 无限制拖延抽样 (Thompson Sampling with Unrestricted Delays) - 专知论文

会员服务 ·

0

样本 · Extensibility · 赌博机/老虎机 · 知识 (knowledge) · 情景 ·

2022 年 5 月 22 日

Thompson Sampling with Unrestricted Delays

翻译：Thompson 无限制拖延抽样

Han Wu,Stefan Wager

We investigate properties of Thompson Sampling in the stochastic multi-armed bandit problem with delayed feedback. In a setting with i.i.d delays, we establish to our knowledge the first regret bounds for Thompson Sampling with arbitrary delay distributions, including ones with unbounded expectation. Our bounds are qualitatively comparable to the best available bounds derived via ad-hoc algorithms, and only depend on delays via selected quantiles of the delay distributions. Furthermore, in extensive simulation experiments, we find that Thompson Sampling outperforms a number of alternative proposals, including methods specifically designed for settings with delayed feedback.

翻译：我们调查Thompson抽样调查在多武装盗匪问题中与拖延反馈问题有关的特点。在出现拖延的环境下,我们知道Thompson抽样的首个遗憾界限是任意拖延分发的,包括无限制预期的。我们的界限在质量上可以与通过特设算法得出的现有最佳界限相比,并且只取决于延迟分发的选定数的延误。此外,在广泛的模拟实验中,我们发现Thompson抽样比一些备选提案要好,包括专门为有延迟反馈的环境设计的方法。

0

相关内容

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

72+阅读 · 2022年7月11日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

CSE1L在神经母细胞瘤发展中的作用及分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

RI与Angiogenin相互作用调控PI3K/AKT/mTOR信号通路和ANG的核转位在膀胱癌发生发展中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

S100A13促甲状腺癌上皮间质化及侵袭转移的作用机制

国家自然科学基金

0+阅读 · 2011年12月31日

Dirichlet空间的分析与几何

国家自然科学基金

0+阅读 · 2011年12月31日

SDQ: Stochastic Differentiable Quantization with Mixed Precision

Arxiv

0+阅读 · 2022年7月11日

Deep Active Learning for Regression Using $ε$-weighted Hybrid Query Strategy

Arxiv

0+阅读 · 2022年7月10日

Smooth Fictitious Play in Stochastic Games with Perturbed Payoffs and Unknown Transitions

Arxiv

0+阅读 · 2022年7月7日

On the instrumental variable estimation with many weak and invalid instruments

Arxiv

0+阅读 · 2022年7月7日

Efficient inverse $Z$-transform and pricing barrier and lookback options with discrete monitoring

Arxiv

0+阅读 · 2022年7月6日

VIP会员

文章信息

相关主题

赌博机/老虎机

知识 (knowledge)

相关VIP内容

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

72+阅读 · 2022年7月11日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《基于AI的动态任务分配策略实现多智能体系统有意义人类控制》报告

《超越连接：AI驱动网络未来愿景》最新报告

人工智能赋能多域作战：能力与挑战

《战场空间决策优势：AI基础与应用研究》总结报告

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

相关论文

SDQ: Stochastic Differentiable Quantization with Mixed Precision

Arxiv

0+阅读 · 2022年7月11日

Deep Active Learning for Regression Using $ε$-weighted Hybrid Query Strategy

Arxiv

0+阅读 · 2022年7月10日

Smooth Fictitious Play in Stochastic Games with Perturbed Payoffs and Unknown Transitions

Arxiv

0+阅读 · 2022年7月7日

On the instrumental variable estimation with many weak and invalid instruments

Arxiv

0+阅读 · 2022年7月7日

Efficient inverse $Z$-transform and pricing barrier and lookback options with discrete monitoring

Arxiv

0+阅读 · 2022年7月6日

相关基金

CSE1L在神经母细胞瘤发展中的作用及分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

RI与Angiogenin相互作用调控PI3K/AKT/mTOR信号通路和ANG的核转位在膀胱癌发生发展中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

S100A13促甲状腺癌上皮间质化及侵袭转移的作用机制

国家自然科学基金

0+阅读 · 2011年12月31日

Dirichlet空间的分析与几何

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员