存储强盗中最佳武器识别:超过$-美元 (Best Arm Identification in Stochastic Bandits: Beyond $β-$optimality) - 专知论文

会员服务 ·

0

优化器 · ARM · Bandits · 样本 · 情景 ·

2023 年 1 月 10 日

Best Arm Identification in Stochastic Bandits: Beyond $β-$optimality

翻译：存储强盗中最佳武器识别:超过$-美元

Arpan Mukherjee,Ali Tajer

This paper focuses on best arm identification (BAI) in stochastic multi-armed bandits (MABs) in the fixed-confidence, parametric setting. In such pure exploration problems, the accuracy of the sampling strategy critically hinges on the sequential allocation of the sampling resources among the arms. The existing approaches to BAI address the following question: what is an optimal sampling strategy when we spend a $\beta$ fraction of the samples on the best arm? These approaches treat $\beta$ as a tunable parameter and offer efficient algorithms that ensure optimality up to selecting $\beta$, hence $\beta-$optimality. However, the BAI decisions and performance can be highly sensitive to the choice of $\beta$. This paper provides a BAI algorithm that is agnostic to $\beta$, dispensing with the need for tuning $\beta$, and specifies an optimal allocation strategy, including the optimal value of $\beta$. Furthermore, the existing relevant literature focuses on the family of exponential distributions. This paper considers a more general setting of any arbitrary family of distributions parameterized by their mean values (under mild regularity conditions).

翻译：本文侧重于固定信心和参数设置中精密多武装强盗(MABs)中最好的手臂识别(BAI),在这种纯粹的勘探问题中,抽样战略的准确性关键取决于武器之间抽样资源的顺序分配。BAI的现有办法解决了下列问题:当我们在最好的手臂上花费样品的一分钱一分钱时,最佳采样战略是什么?这些办法把$\Beta美元作为金枪鱼的参数,并提供有效的算法,确保最佳性地选择$\beta美元,也就是$\beta-obatity。然而,BAI的决定和性能对$\beta美元的选择可能非常敏感。本文提供了一种BAI算法的算法,该算法对美元和Beta美元具有敏感性,解决了调整$\beta美元的最佳分配战略,包括美元/beta美元的最佳价值。此外,现有的有关文献侧重于指数分布的家庭。本文认为,任何任意的分布范围都比较一般地按其平均值定出的分布参数。

0

相关内容

优化器

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

大豆GmMYB1应答植物盐胁迫的表观遗传调控及其作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

IGF1R-RACK1-STAT3通路在苦蘵内酯P抗EGFR T790M突变非小细胞肺癌中的作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

三种吴茱萸属植物中新型吲哚喹唑啉生物碱的发现及其抗真菌活性研究

国家自然科学基金

0+阅读 · 2014年12月31日

Actinophyllic Acid类含七元环的复杂多环活性天然产物全合成研究

国家自然科学基金

0+阅读 · 2014年12月31日

Cr3+:ABSi2O6(A=Na，K，Ca；B=Mg，Al)可调谐激光晶体的研制

国家自然科学基金

0+阅读 · 2014年12月31日

机械式自动变速器的滚动优化控制

国家自然科学基金

0+阅读 · 2012年12月31日

药用植物内生真菌抗菌活性次生代谢产物研究

国家自然科学基金

0+阅读 · 2012年12月31日

以EGFR为识别靶位多靶点联合克服NSCLC EGFR TKIs耐药的基因干预研究

国家自然科学基金

0+阅读 · 2011年12月31日

三维网络结构在微通道内表面上的构筑及其在痕量蛋白质富集分离上的应用

国家自然科学基金

0+阅读 · 2011年12月31日

Curcumin双向调控HO-1/HO-2协同抑制Aβeme复合物防治AD的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

Approximate degree lower bounds for oracle identification problems

Arxiv

0+阅读 · 2023年3月7日

Multilevel Monte Carlo methods for stochastic convection-diffusion eigenvalue problems

Arxiv

0+阅读 · 2023年3月7日

A Unified Algebraic Perspective on Lipschitz Neural Networks

Arxiv

0+阅读 · 2023年3月6日

Numerical analysis of a nonsmooth quasilinear elliptic control problem: I. Explicit second-order optimality conditions

Arxiv

0+阅读 · 2023年3月6日

Low-discrepancy Sampling in the Expanded Dimensional Space: An Acceleration Technique for Particle Swarm Optimization

Arxiv

0+阅读 · 2023年3月6日

PRECISION: Decentralized Constrained Min-Max Learning with Low Communication and Sample Complexities

Arxiv

0+阅读 · 2023年3月5日

SPRT-based Efficient Best Arm Identification in Stochastic Bandits

Arxiv

0+阅读 · 2023年3月4日

On Private and Robust Bandits

Arxiv

0+阅读 · 2023年3月4日

L-2 Regularized maximum likelihood for $β$-model in large and sparse networks

Arxiv

0+阅读 · 2023年3月4日

Locally Regularized Neural Differential Equations: Some Black Boxes were meant to remain closed!

Arxiv

0+阅读 · 2023年3月3日

VIP会员

文章信息

相关主题

相关VIP内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《代码、指挥与冲突：描绘军事人工智能的未来》报告

【斯坦福博士论文】面向地理空间数据的多模态与多尺度建模：时空生成式人工智能

美国启动“自有军事人工智能计划”：采用谷歌Gemini以推动全军人工智能应用

《创新与适应性作为军事成功的关键因素：来自俄乌战争的战略洞见》报告

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

相关论文

Approximate degree lower bounds for oracle identification problems

Arxiv

0+阅读 · 2023年3月7日

Multilevel Monte Carlo methods for stochastic convection-diffusion eigenvalue problems

Arxiv

0+阅读 · 2023年3月7日

A Unified Algebraic Perspective on Lipschitz Neural Networks

Arxiv

0+阅读 · 2023年3月6日

Numerical analysis of a nonsmooth quasilinear elliptic control problem: I. Explicit second-order optimality conditions

Arxiv

0+阅读 · 2023年3月6日

Low-discrepancy Sampling in the Expanded Dimensional Space: An Acceleration Technique for Particle Swarm Optimization

Arxiv

0+阅读 · 2023年3月6日

PRECISION: Decentralized Constrained Min-Max Learning with Low Communication and Sample Complexities

Arxiv

0+阅读 · 2023年3月5日

SPRT-based Efficient Best Arm Identification in Stochastic Bandits

Arxiv

0+阅读 · 2023年3月4日

On Private and Robust Bandits

Arxiv

0+阅读 · 2023年3月4日

L-2 Regularized maximum likelihood for $β$-model in large and sparse networks

Arxiv

0+阅读 · 2023年3月4日

Locally Regularized Neural Differential Equations: Some Black Boxes were meant to remain closed!

Arxiv

0+阅读 · 2023年3月3日

相关基金

大豆GmMYB1应答植物盐胁迫的表观遗传调控及其作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

IGF1R-RACK1-STAT3通路在苦蘵内酯P抗EGFR T790M突变非小细胞肺癌中的作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

三种吴茱萸属植物中新型吲哚喹唑啉生物碱的发现及其抗真菌活性研究

国家自然科学基金

0+阅读 · 2014年12月31日

Actinophyllic Acid类含七元环的复杂多环活性天然产物全合成研究

国家自然科学基金

0+阅读 · 2014年12月31日

Cr3+:ABSi2O6(A=Na，K，Ca；B=Mg，Al)可调谐激光晶体的研制

国家自然科学基金

0+阅读 · 2014年12月31日

机械式自动变速器的滚动优化控制

国家自然科学基金

0+阅读 · 2012年12月31日

药用植物内生真菌抗菌活性次生代谢产物研究

国家自然科学基金

0+阅读 · 2012年12月31日

以EGFR为识别靶位多靶点联合克服NSCLC EGFR TKIs耐药的基因干预研究

国家自然科学基金

0+阅读 · 2011年12月31日

三维网络结构在微通道内表面上的构筑及其在痕量蛋白质富集分离上的应用

国家自然科学基金

0+阅读 · 2011年12月31日

Curcumin双向调控HO-1/HO-2协同抑制Aβeme复合物防治AD的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员