固定预算最佳武器鉴定全球最佳最佳标准 (Globally Optimal Algorithms for Fixed-Budged Best Arm Identification) - 专知论文

会员服务 ·

0

优化器 · 全局优化 · ARM · DOT · Neural Networks ·

2022 年 6 月 9 日

Globally Optimal Algorithms for Fixed-Budged Best Arm Identification

翻译：固定预算最佳武器鉴定全球最佳最佳标准

Junpei Komiyama,Taira Tsuchiya,Junya Honda

We consider the fixed-budget best arm identification problem where the goal is to find the arm of the largest mean with a fixed number of samples. It is known that the probability of misidentifying the best arm is exponentially small to the number of rounds. However, limited characterizations have been discussed on the rate (exponent) of this value. In this paper, we characterize the optimal rate as a result of global optimization over all possible parameters. We introduce two rates, $R^{\mathrm{go}}$ and $R^{\mathrm{go}}_{\infty}$, corresponding to lower bounds on the misidentification probability, each of which is associated with a proposed algorithm. The rate $R^{\mathrm{go}}$ is associated with $R^{\mathrm{go}}$-tracking, which can be efficiently implemented by a neural network and is shown to outperform existing algorithms. However, this rate requires a nontrivial condition to be achievable. To deal with this issue, we introduce the second rate $R^{\mathrm{go}}_\infty$. We show that this rate is indeed achievable by introducing a conceptual algorithm called delayed optimal tracking (DOT).

翻译：我们考虑的是固定预算最佳手臂识别问题,目标是找到最大平均值的手臂,并有固定数量的样本。已知误认最佳手臂的可能性极小,与弹道数量相比是成倍的。然而,对这一数值的速率(用量)进行了有限的定性讨论。在本文中,我们根据全球优化对所有可能的参数的最佳速率进行了定性。我们引入了两种汇率,即美元和美元,相当于误辨概率的下限,每种误辨概率都与提议的算法有关。美元-马特尔姆{戈涅因菲特元与美元-跟踪有关,可以通过神经网络高效地实施,并显示超过现有的算法。然而,这一比率需要一种非边际条件才能实现。为了解决这个问题,我们引入了第二种汇率美元-马特尔姆{戈涅因菲特$。我们表明,采用一种称为“最佳跟踪”的概念性算法确实可以实现这一速率。

0

相关内容

优化器

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

由金属有机框架和聚合物电解质制备复合质子传导膜及其结构和性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

Ti-Ag 纳米管/HA 复合涂层的抗菌性能及生物相容性的基础研究

国家自然科学基金

0+阅读 · 2015年12月31日

介孔金属氮化物在燃料电池中的应用

国家自然科学基金

0+阅读 · 2014年12月31日

多环（杂）芳烃桥联双金属化合物的合成及其性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

TREM-1/DAP12/ NF-κB信号通路在6-姜烯酚抗动脉粥样硬化中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

具有抗肿瘤活性天然产物Marmycin A的全合成、衍生化及生物活性研究

国家自然科学基金

0+阅读 · 2012年12月31日

MYC2互作蛋白MFC1调控茉莉酸响应基因转录表达的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

Nac-1对c-Myc和Klf4基因的转录调控及其调节ES细胞多能性的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

藤黄属植物抗肿瘤活性成分及其作用机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

激光陀螺捷联惯导复合动态环境适应性

国家自然科学基金

0+阅读 · 2009年12月31日

New Optimal Periodic Control Policy for the Optimal Periodic Performance of a Chemostat Using a Fourier-Gegenbauer-Based Predictor-Corrector Method

Arxiv

0+阅读 · 2022年7月25日

On the convergence and sampling of randomized primal-dual algorithms and their application to parallel MRI reconstruction

Arxiv

0+阅读 · 2022年7月25日

Optimal Convergence Rates of Deep Neural Networks in a Classification Setting

Arxiv

0+阅读 · 2022年7月25日

Fast convergence rates for dose-response estimation

Arxiv

0+阅读 · 2022年7月24日

A Continuous-Time Perspective on Optimal Methods for Monotone Equation Problems

Arxiv

0+阅读 · 2022年7月24日

An Answer to the Bose-Nelson Sorting Problem for 11 and 12 Channels

Arxiv

0+阅读 · 2022年7月24日

Non-asymptotic near optimal algorithms for two sided matchings

Arxiv

0+阅读 · 2022年7月24日

Exact Matrix Factorization Updates for Nonlinear Programming

Exact Matrix Factorization Updates for Nonlinear Programming

Arxiv

0+阅读 · 2022年7月22日

Optimal precision for GANs

Optimal precision for GANs

Arxiv

0+阅读 · 2022年7月21日

Hyper-Parameter Optimization: A Review of Algorithms and Applications

Hyper-Parameter Optimization: A Review of Algorithms and Applications

Arxiv

16+阅读 · 2020年3月12日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

视觉-语言-动作模型解析：从模块构成到里程碑与挑战

《解析陆域作战方向：一个概念性框架》报告

【博士论文】基于多模态基础模型的上下文学习

追寻真正的AI自主性：从遗留思维到战场优势

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

New Optimal Periodic Control Policy for the Optimal Periodic Performance of a Chemostat Using a Fourier-Gegenbauer-Based Predictor-Corrector Method

Arxiv

0+阅读 · 2022年7月25日

On the convergence and sampling of randomized primal-dual algorithms and their application to parallel MRI reconstruction

Arxiv

0+阅读 · 2022年7月25日

Optimal Convergence Rates of Deep Neural Networks in a Classification Setting

Arxiv

0+阅读 · 2022年7月25日

Fast convergence rates for dose-response estimation

Arxiv

0+阅读 · 2022年7月24日

A Continuous-Time Perspective on Optimal Methods for Monotone Equation Problems

Arxiv

0+阅读 · 2022年7月24日

An Answer to the Bose-Nelson Sorting Problem for 11 and 12 Channels

Arxiv

0+阅读 · 2022年7月24日

Non-asymptotic near optimal algorithms for two sided matchings

Arxiv

0+阅读 · 2022年7月24日

Exact Matrix Factorization Updates for Nonlinear Programming

Exact Matrix Factorization Updates for Nonlinear Programming

Arxiv

0+阅读 · 2022年7月22日

Optimal precision for GANs

Optimal precision for GANs

Arxiv

0+阅读 · 2022年7月21日

Hyper-Parameter Optimization: A Review of Algorithms and Applications

Hyper-Parameter Optimization: A Review of Algorithms and Applications

Arxiv

16+阅读 · 2020年3月12日

相关基金

由金属有机框架和聚合物电解质制备复合质子传导膜及其结构和性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

Ti-Ag 纳米管/HA 复合涂层的抗菌性能及生物相容性的基础研究

国家自然科学基金

0+阅读 · 2015年12月31日

介孔金属氮化物在燃料电池中的应用

国家自然科学基金

0+阅读 · 2014年12月31日

多环（杂）芳烃桥联双金属化合物的合成及其性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

TREM-1/DAP12/ NF-κB信号通路在6-姜烯酚抗动脉粥样硬化中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

具有抗肿瘤活性天然产物Marmycin A的全合成、衍生化及生物活性研究

国家自然科学基金

0+阅读 · 2012年12月31日

MYC2互作蛋白MFC1调控茉莉酸响应基因转录表达的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

Nac-1对c-Myc和Klf4基因的转录调控及其调节ES细胞多能性的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

藤黄属植物抗肿瘤活性成分及其作用机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

激光陀螺捷联惯导复合动态环境适应性

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员