实现线形强盗中最优化最优化武器识别 (Towards Minimax Optimal Best Arm Identification in Linear Bandits) - 专知论文

会员服务 ·

0

赌博机/老虎机 · 优化器 · 线性的 · ARM · Performer ·

2021 年 5 月 27 日

Towards Minimax Optimal Best Arm Identification in Linear Bandits

翻译：实现线形强盗中最优化最优化武器识别

Junwen Yang,Vincent Y. F. Tan

from arxiv, 20 pages, 4 figures

We study the problem of best arm identification in linear bandits in the fixed-budget setting. By leveraging properties of the G-optimal design and incorporating it into the arm allocation rule, we design a parameter-free algorithm, Optimal Design-based Linear Best Arm Identification (OD-LinBAI). We provide a theoretical analysis of the failure probability of OD-LinBAI. While the performances of existing methods (e.g., BayesGap) depend on all the optimality gaps, OD-LinBAI depends on the gaps of the top $d$ arms, where $d$ is the effective dimension of the linear bandit instance. Furthermore, we present a minimax lower bound for this problem. The upper and lower bounds show that OD-LinBAI is minimax optimal up to multiplicative factors in the exponent. Finally, numerical experiments corroborate our theoretical findings.

翻译：我们研究了在固定预算环境中线性土匪中最佳武器识别问题。我们利用G-最佳设计特性并将其纳入武器分配规则,设计了一个无参数算法,即基于最佳设计的最佳线性武器识别(OD-LinBAI)。我们对OD-LinBAI的失败概率进行了理论分析。虽然现有方法(例如BayesGap)的性能取决于所有最佳性差,但OD-LinBAI取决于顶端的美元武器的差距,而美元是线性土匪实例的有效维度。此外,我们为这一问题提出了一条小号,小号为这一问题设定了下界。上下界显示OD-LinBAI的微轴最优性能与引量的多倍性因素。最后,数字实验证实了我们的理论结论。

0

相关内容

赌博机/老虎机

赌博机/老虎机

ICML2021接受论文列表出炉！1184篇论文都在这了！

专知会员服务

92+阅读 · 2021年6月3日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

60+阅读 · 2020年11月21日

最新《非光滑优化》十讲硬核课程，剑桥大学梁经纬博士主讲

最新《非光滑优化》十讲硬核课程，剑桥大学梁经纬博士主讲

专知会员服务

34+阅读 · 2020年8月14日

最新！CCF-A类人工智能顶会WWW2020最佳论文出炉！OSU最佳论文，北邮斩获最佳学生论文！

最新！CCF-A类人工智能顶会WWW2020最佳论文出炉！OSU最佳论文，北邮斩获最佳学生论文！

专知会员服务

27+阅读 · 2020年4月25日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

经典书《斯坦福大学-多智能体系统》532页pdf，MULTIAGENT SYSTEMS Algorithmic, Game-Theoretic, and Logical Foundations

经典书《斯坦福大学-多智能体系统》532页pdf，MULTIAGENT SYSTEMS Algorithmic, Game-Theoretic, and Logical Foundations

专知会员服务

158+阅读 · 2020年1月29日

【ICCV 2019】贝叶斯优化的1-Bit CNNs 《Bayesian Optimized 1-Bit CNNs》

【ICCV 2019】贝叶斯优化的1-Bit CNNs 《Bayesian Optimized 1-Bit CNNs》

专知会员服务

16+阅读 · 2019年11月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

CCF C类 | DSAA 2019 诚邀稿件

CCF C类 | DSAA 2019 诚邀稿件

Call4Papers

6+阅读 · 2019年5月13日

CCF A类 | 顶级会议RTSS 2019诚邀稿件

CCF A类 | 顶级会议RTSS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年4月17日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【ICCV 2017论文集】计算机视觉顶级会议ICCV2017 Open Access Repository

【ICCV 2017论文集】计算机视觉顶级会议ICCV2017 Open Access Repository

专知

6+阅读 · 2017年10月14日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Towards a Decomposition-Optimal Algorithm for Counting and Sampling Arbitrary Motifs in Sublinear Time

Arxiv

0+阅读 · 2021年7月19日

Rate-Distortion Analysis of Minimum Excess Risk in Bayesian Learning

Arxiv

0+阅读 · 2021年7月17日

On Efficient Optimal Transport: An Analysis of Greedy and Accelerated Mirror Descent Algorithms

Arxiv

0+阅读 · 2021年7月17日

Lower Bound for Sculpture Garden Problem

Arxiv

0+阅读 · 2021年7月17日

Reinforcement Learning for Optimal Stationary Control of Linear Stochastic Systems

Arxiv

0+阅读 · 2021年7月16日

Linear Programming Bounds for Almost-Balanced Binary Codes

Arxiv

0+阅读 · 2021年7月16日

Optimal tests of the composite null hypothesis arising in mediation analysis

Arxiv

0+阅读 · 2021年7月15日

Towards a Dimension-Free Understanding of Adaptive Linear Control

Arxiv

0+阅读 · 2021年7月15日

Generalized Covariance Estimator

Arxiv

0+阅读 · 2021年7月14日

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics

Arxiv

7+阅读 · 2018年6月12日

VIP会员

文章信息

相关主题

赌博机/老虎机

相关VIP内容

ICML2021接受论文列表出炉！1184篇论文都在这了！

专知会员服务

92+阅读 · 2021年6月3日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

60+阅读 · 2020年11月21日

最新《非光滑优化》十讲硬核课程，剑桥大学梁经纬博士主讲

最新《非光滑优化》十讲硬核课程，剑桥大学梁经纬博士主讲

专知会员服务

34+阅读 · 2020年8月14日

最新！CCF-A类人工智能顶会WWW2020最佳论文出炉！OSU最佳论文，北邮斩获最佳学生论文！

最新！CCF-A类人工智能顶会WWW2020最佳论文出炉！OSU最佳论文，北邮斩获最佳学生论文！

专知会员服务

27+阅读 · 2020年4月25日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

经典书《斯坦福大学-多智能体系统》532页pdf，MULTIAGENT SYSTEMS Algorithmic, Game-Theoretic, and Logical Foundations

经典书《斯坦福大学-多智能体系统》532页pdf，MULTIAGENT SYSTEMS Algorithmic, Game-Theoretic, and Logical Foundations

专知会员服务

158+阅读 · 2020年1月29日

【ICCV 2019】贝叶斯优化的1-Bit CNNs 《Bayesian Optimized 1-Bit CNNs》

【ICCV 2019】贝叶斯优化的1-Bit CNNs 《Bayesian Optimized 1-Bit CNNs》

专知会员服务

16+阅读 · 2019年11月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

视觉-语言-动作模型解析：从模块构成到里程碑与挑战

《解析陆域作战方向：一个概念性框架》报告

【博士论文】基于多模态基础模型的上下文学习

追寻真正的AI自主性：从遗留思维到战场优势

相关资讯

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

CCF C类 | DSAA 2019 诚邀稿件

CCF C类 | DSAA 2019 诚邀稿件

Call4Papers

6+阅读 · 2019年5月13日

CCF A类 | 顶级会议RTSS 2019诚邀稿件

CCF A类 | 顶级会议RTSS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年4月17日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【ICCV 2017论文集】计算机视觉顶级会议ICCV2017 Open Access Repository

【ICCV 2017论文集】计算机视觉顶级会议ICCV2017 Open Access Repository

专知

6+阅读 · 2017年10月14日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Towards a Decomposition-Optimal Algorithm for Counting and Sampling Arbitrary Motifs in Sublinear Time

Arxiv

0+阅读 · 2021年7月19日

Rate-Distortion Analysis of Minimum Excess Risk in Bayesian Learning

Arxiv

0+阅读 · 2021年7月17日

On Efficient Optimal Transport: An Analysis of Greedy and Accelerated Mirror Descent Algorithms

Arxiv

0+阅读 · 2021年7月17日

Lower Bound for Sculpture Garden Problem

Arxiv

0+阅读 · 2021年7月17日

Reinforcement Learning for Optimal Stationary Control of Linear Stochastic Systems

Arxiv

0+阅读 · 2021年7月16日

Linear Programming Bounds for Almost-Balanced Binary Codes

Arxiv

0+阅读 · 2021年7月16日

Optimal tests of the composite null hypothesis arising in mediation analysis

Arxiv

0+阅读 · 2021年7月15日

Towards a Dimension-Free Understanding of Adaptive Linear Control

Arxiv

0+阅读 · 2021年7月15日

Generalized Covariance Estimator

Arxiv

0+阅读 · 2021年7月14日

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics

Arxiv

7+阅读 · 2018年6月12日

微信扫码咨询专知VIP会员