在 " 小额差距下 " 固定预算 (Best Arm Identification with a Fixed Budget under a Small Gap) - 专知论文

会员服务 ·

0

优化器 · ARM · 赌博机/老虎机 · Performer · 估计/估计量 ·

2022 年 2 月 10 日

Best Arm Identification with a Fixed Budget under a Small Gap

翻译：在 " 小额差距下 " 固定预算

Masahiro Kato,Kaito Ariu,Masaaki Imaizumi,Masatoshi Uehara,Masahiro Nomura,Chao Qin

We consider the fixed-budget best arm identification problem in the multi-armed bandit problem. One of the main interests in this field is to derive a tight lower bound on the probability of misidentifying the best arm and to develop a strategy whose performance guarantee matches the lower bound. However, it has long been an open problem when the optimal allocation ratio of arm draws is unknown. In this paper, we provide an answer for this problem under which the gap between the expected rewards is small. First, we derive a tight problem-dependent lower bound, which characterizes the optimal allocation ratio that depends on the gap of the expected rewards and the Fisher information of the bandit model. Then, we propose the "RS-AIPW" strategy, which consists of the randomized sampling (RS) rule using the estimated optimal allocation ratio and the recommendation rule using the augmented inverse probability weighting (AIPW) estimator. Our proposed strategy is optimal in the sense that the performance guarantee achieves the derived lower bound under a small gap. In the course of the analysis, we present a novel large deviation bound for martingales.

翻译：我们考虑的是多武装匪徒问题中固定预算最佳手臂识别问题。这个领域的主要利益之一是对确定最佳手臂的误差概率进行严格较低的限制,并制定一项业绩保证与较低约束相符的战略。然而,当武器抽取的最佳分配比率未知时,这个问题长期以来就是一个未解决的问题。在本文中,我们为预期报酬之间的差距小的问题提供了一个答案。首先,我们得出了一个紧紧的问题依赖较低约束,这是最佳分配比率的特点,而最佳分配比率取决于预期报酬的差距和强盗模式的渔业信息。然后,我们提出“RS-AIPW”战略,它由随机抽样抽样规则(RS)组成,使用估计的最佳分配比率以及建议规则,使用增加的相反概率加权(AIPW)估计值。我们提出的战略是最佳的,因为业绩保证在很小的缺口下获得较低约束。在分析过程中,我们提出了一个新的大偏差。

0

相关内容

优化器

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

靶向肿瘤干细胞治疗肝癌的多模态影像研究

国家自然科学基金

0+阅读 · 2014年12月31日

考虑非定常气动力随机不确定性的气动弹性研究

国家自然科学基金

0+阅读 · 2013年12月31日

有效节省电子学的MICROMEGAS位置编码读出研究

国家自然科学基金

0+阅读 · 2012年12月31日

可穿戴式无线体域网用UWB健康监护与遥测系统芯片研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于CT影像的肺结节计算机辅助诊断方法及关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于多维融合信息的肺结节检测与良恶性鉴别

国家自然科学基金

1+阅读 · 2012年12月31日

Hadoop云存储中基于Ordinal Bloom filter的多维索引关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

图的若干参数及算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

跨语言信息检索中的机器翻译研究

国家自然科学基金

2+阅读 · 2011年12月31日

非小细胞肺癌预后相关微小RNA多态的筛选与功能研究

国家自然科学基金

0+阅读 · 2009年12月31日

Online Caching with Optimistic Learning

Arxiv

1+阅读 · 2022年4月20日

Improving Proximity Classification for Contact Tracing using a Multi-channel Approach

Improving Proximity Classification for Contact Tracing using a Multi-channel Approach

Arxiv

0+阅读 · 2022年4月20日

Active Few-Shot Learning with FASL

Arxiv

0+阅读 · 2022年4月20日

Making Progress Based on False Discoveries

Arxiv

0+阅读 · 2022年4月19日

Learning Augmented Online Facility Location

Arxiv

0+阅读 · 2022年4月18日

Risk and optimal policies in bandit experiments

Risk and optimal policies in bandit experiments

Arxiv

0+阅读 · 2022年4月18日

Optimal Conformal Prediction for Small Areas

Arxiv

0+阅读 · 2022年4月18日

FocalClick: Towards Practical Interactive Image Segmentation

Arxiv

0+阅读 · 2022年4月17日

Transfer Learning under High-dimensional Generalized Linear Models

Arxiv

0+阅读 · 2022年4月17日

Linear Programs with Polynomial Coefficients and Applications to 1D Cellular Automata

Arxiv

0+阅读 · 2022年4月15日

VIP会员

文章信息

相关主题

赌博机/老虎机

估计/估计量

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《美空军条令出版物：战略打击》最新条令

《高能激光武器》22页slides

军事前沿模型

《面向小型无人机或无人飞行器的创新雷达探测与人工智能分类技术》263页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

相关论文

Online Caching with Optimistic Learning

Arxiv

1+阅读 · 2022年4月20日

Improving Proximity Classification for Contact Tracing using a Multi-channel Approach

Improving Proximity Classification for Contact Tracing using a Multi-channel Approach

Arxiv

0+阅读 · 2022年4月20日

Active Few-Shot Learning with FASL

Arxiv

0+阅读 · 2022年4月20日

Making Progress Based on False Discoveries

Arxiv

0+阅读 · 2022年4月19日

Learning Augmented Online Facility Location

Arxiv

0+阅读 · 2022年4月18日

Risk and optimal policies in bandit experiments

Risk and optimal policies in bandit experiments

Arxiv

0+阅读 · 2022年4月18日

Optimal Conformal Prediction for Small Areas

Arxiv

0+阅读 · 2022年4月18日

FocalClick: Towards Practical Interactive Image Segmentation

Arxiv

0+阅读 · 2022年4月17日

Transfer Learning under High-dimensional Generalized Linear Models

Arxiv

0+阅读 · 2022年4月17日

Linear Programs with Polynomial Coefficients and Applications to 1D Cellular Automata

Arxiv

0+阅读 · 2022年4月15日

相关基金

靶向肿瘤干细胞治疗肝癌的多模态影像研究

国家自然科学基金

0+阅读 · 2014年12月31日

考虑非定常气动力随机不确定性的气动弹性研究

国家自然科学基金

0+阅读 · 2013年12月31日

有效节省电子学的MICROMEGAS位置编码读出研究

国家自然科学基金

0+阅读 · 2012年12月31日

可穿戴式无线体域网用UWB健康监护与遥测系统芯片研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于CT影像的肺结节计算机辅助诊断方法及关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于多维融合信息的肺结节检测与良恶性鉴别

国家自然科学基金

1+阅读 · 2012年12月31日

Hadoop云存储中基于Ordinal Bloom filter的多维索引关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

图的若干参数及算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

跨语言信息检索中的机器翻译研究

国家自然科学基金

2+阅读 · 2011年12月31日

非小细胞肺癌预后相关微小RNA多态的筛选与功能研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员