常见最佳武器鉴定中Bayes 最佳比标值的亚优性表现 (Suboptimal Performance of the Bayes Optimal Algorithm in Frequentist Best Arm Identification) - 专知论文

会员服务 ·

0

频率主义学派 · SimPLe · Performer · 优化器 · ARM ·

2022 年 8 月 19 日

Suboptimal Performance of the Bayes Optimal Algorithm in Frequentist Best Arm Identification

翻译：常见最佳武器鉴定中Bayes 最佳比标值的亚优性表现

Junpei Komiyama

We consider the fixed-budget best-arm identification problem with Normal reward distributions. In this problem, the forecaster is given $K$ arms (or treatments) and $T$ time steps. The forecaster attempts to find the best arm, defined by the largest mean, via an adaptive experiment conducted using an algorithm. The algorithm's performance is measured by the simple regret, that is, the quality of the estimated best arm. The frequentist simple regret can be exponentially small to $T$, whereas the Bayesian simple regret is polynomially small to $T$. This paper demonstrates that Bayes optimal algorithm, which minimizes the Bayesian simple regret, does not produce an exponential simple regret for some parameters, a finding that contrasts with the many results indicating the asymptotic equivalence of Bayesian and frequentist algorithms in the context of fixed sampling regimes. While the Bayes optimal algorithm is described in terms of a recursive equation that is virtually impossible to compute exactly, we establish the foundations for further analysis by introducing a key quantity that we call the expected Bellman improvement.

翻译：我们用正常的奖励分配来考虑固定预算最佳武器识别问题。在这个问题中, 预报员得到的是KK$的军火( 或治疗) 和$T 的时间步骤。预报员试图通过使用算法进行的适应性实验找到用最大平均值定义的最好的手臂。算法的性能是通过简单的遗憾来测量的, 也就是说, 估计的最好的手臂的质量。经常者简单遗憾可以指数化地小到$T, 而巴耶斯简单的遗憾是单数小到$T。本文表明, 巴伊斯的最佳算法, 尽可能减少巴伊西亚人的简单遗憾, 并没有对某些参数产生指数化的简单遗憾, 这一结果与在固定采样制度中表明巴伊西亚人和经常性算法的无足轻重的等同性结果形成对比。虽然贝伊斯的最佳算法是用一种折叠式的公式描述的, 几乎无法准确计算, 我们为进一步分析打下基础, 提出一个我们称之为贝尔曼改进的关键数量。

0

相关内容

频率主义学派

频率主义学派

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

齿梗孢霉产aurovertin类化合物的生物合成研究

国家自然科学基金

0+阅读 · 2014年12月31日

S3AGA样本（Spitzer-SDSS Spectral Atlas of Galaxies and AGNs)及其AGN研究

国家自然科学基金

0+阅读 · 2014年12月31日

Neolaxiflorin B的全合成研究

国家自然科学基金

0+阅读 · 2014年12月31日

多环（杂）芳烃桥联双金属化合物的合成及其性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于正则Vine copula的相依建模及软件开发

国家自然科学基金

0+阅读 · 2012年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

过渡金属催化的二茂铁联烯化合物的偶联反应研究

国家自然科学基金

0+阅读 · 2009年12月31日

拋物奇异积分算子有界性及其应用

国家自然科学基金

0+阅读 · 2009年12月31日

铁电配合物的合成，结构与性质研究

国家自然科学基金

0+阅读 · 2008年12月31日

硅杂环戊二烯共轭低聚物的设计、合成及光电性能

国家自然科学基金

0+阅读 · 2008年12月31日

Inference on Causal Effects of Interventions in Time using Gaussian Processes

Arxiv

0+阅读 · 2022年10月6日

On the detrimental effect of invariances in the likelihood for variational inference

Arxiv

0+阅读 · 2022年10月6日

The Power of Duality: Response Time Analysis meets Integer Programming

Arxiv

0+阅读 · 2022年10月5日

A uniform kernel trick for high-dimensional two-sample problems

Arxiv

0+阅读 · 2022年10月5日

A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games

Arxiv

0+阅读 · 2022年10月4日

Dealing with Unknown Variances in Best-Arm Identification

Arxiv

0+阅读 · 2022年10月3日

Bayesian Inference using the Proximal Mapping: Uncertainty Quantification under Varying Dimensionality

Arxiv

0+阅读 · 2022年10月3日

On Best-Arm Identification with a Fixed Budget in Non-Parametric Multi-Armed Bandits

Arxiv

0+阅读 · 2022年9月30日

Sequential Fair Allocation: Achieving the Optimal Envy-Efficiency Tradeoff Curve

Arxiv

0+阅读 · 2022年9月29日

A Quantitative Account of Harm

Arxiv

0+阅读 · 2022年9月29日

VIP会员

文章信息

相关主题

频率主义学派

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Inference on Causal Effects of Interventions in Time using Gaussian Processes

Arxiv

0+阅读 · 2022年10月6日

On the detrimental effect of invariances in the likelihood for variational inference

Arxiv

0+阅读 · 2022年10月6日

The Power of Duality: Response Time Analysis meets Integer Programming

Arxiv

0+阅读 · 2022年10月5日

A uniform kernel trick for high-dimensional two-sample problems

Arxiv

0+阅读 · 2022年10月5日

A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games

Arxiv

0+阅读 · 2022年10月4日

Dealing with Unknown Variances in Best-Arm Identification

Arxiv

0+阅读 · 2022年10月3日

Bayesian Inference using the Proximal Mapping: Uncertainty Quantification under Varying Dimensionality

Arxiv

0+阅读 · 2022年10月3日

On Best-Arm Identification with a Fixed Budget in Non-Parametric Multi-Armed Bandits

Arxiv

0+阅读 · 2022年9月30日

Sequential Fair Allocation: Achieving the Optimal Envy-Efficiency Tradeoff Curve

Arxiv

0+阅读 · 2022年9月29日

A Quantitative Account of Harm

Arxiv

0+阅读 · 2022年9月29日

相关基金

齿梗孢霉产aurovertin类化合物的生物合成研究

国家自然科学基金

0+阅读 · 2014年12月31日

S3AGA样本（Spitzer-SDSS Spectral Atlas of Galaxies and AGNs)及其AGN研究

国家自然科学基金

0+阅读 · 2014年12月31日

Neolaxiflorin B的全合成研究

国家自然科学基金

0+阅读 · 2014年12月31日

多环（杂）芳烃桥联双金属化合物的合成及其性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于正则Vine copula的相依建模及软件开发

国家自然科学基金

0+阅读 · 2012年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

过渡金属催化的二茂铁联烯化合物的偶联反应研究

国家自然科学基金

0+阅读 · 2009年12月31日

拋物奇异积分算子有界性及其应用

国家自然科学基金

0+阅读 · 2009年12月31日

铁电配合物的合成，结构与性质研究

国家自然科学基金

0+阅读 · 2008年12月31日

硅杂环戊二烯共轭低聚物的设计、合成及光电性能

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员