处理最佳武器识别的未知差异 (Dealing with Unknown Variances in Best-Arm Identification) - 专知论文

会员服务 ·

0

方差 · 可辨认的 · 相互独立的 · 样本复杂度 · Performer ·

2023 年 1 月 23 日

Dealing with Unknown Variances in Best-Arm Identification

翻译：处理最佳武器识别的未知差异

Marc Jourdan,Rémy Degenne,Emilie Kaufmann

from arxiv, 73 pages, 5 figures, 3 tables. To be published in the 34th International Conference on Algorithmic Learning Theory, Singapore, 2023

The problem of identifying the best arm among a collection of items having Gaussian rewards distribution is well understood when the variances are known. Despite its practical relevance for many applications, few works studied it for unknown variances. In this paper we introduce and analyze two approaches to deal with unknown variances, either by plugging in the empirical variance or by adapting the transportation costs. In order to calibrate our two stopping rules, we derive new time-uniform concentration inequalities, which are of independent interest. Then, we illustrate the theoretical and empirical performances of our two sampling rule wrappers on Track-and-Stop and on a Top Two algorithm. Moreover, by quantifying the impact on the sample complexity of not knowing the variances, we reveal that it is rather small.

翻译：当差异为人所知时,人们就非常了解如何确定拥有高山奖赏分配的集合项目中的最佳部分。尽管它对于许多应用程序具有实际意义,但很少有人研究它,因为差异程度不详。在本文中,我们提出和分析两种方法来应对未知差异,要么填补经验差异,要么调整运输成本。为了校正我们两个停止规则,我们产生了新的时间-统一集中不平等,这是独立感兴趣的。然后,我们举例说明了我们在轨迹和停止以及顶层二级算法上的两个抽样规则包件的理论和经验表现。此外,通过量化不知道差异对抽样复杂性的影响,我们发现它相当小。

0

相关内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

一类时滞非线性系统随机动力学与控制

国家自然科学基金

0+阅读 · 2012年12月31日

表面等离子体共振单核苷酸多态性分型新方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

磁性Pickering乳液界面流变学研究

国家自然科学基金

0+阅读 · 2008年12月31日

应用金属组学方法研究铂类抗肿瘤药物与生物体内分子的相互作用

国家自然科学基金

0+阅读 · 2008年12月31日

Best arm identification in rare events

Arxiv

0+阅读 · 2023年3月14日

Sensitivity Analysis with the $R^2$-calculus

Arxiv

0+阅读 · 2023年3月13日

Doubly Robust Estimation under Covariate-Induced Dependent Left Truncation

Arxiv

0+阅读 · 2023年3月13日

Trade-offs in Static and Dynamic Evaluation of Hierarchical Queries

Arxiv

0+阅读 · 2023年3月12日

A novel notion of barycenter for probability distributions based on optimal weak mass transport

Arxiv

0+阅读 · 2023年3月10日

VIP会员

文章信息

相关主题

相互独立的

样本复杂度

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

智能体工程（Agent Engineering）

《全球地缘政治环境中的反无人机系统互操作性》252页

专业软件开发者不靠“氛围编程”（Vibe Coding），而靠“控制”：2025 年 AI Agent 在编程中的应用研究

基于大语言模型的智能体化软件问题解决：综述

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Best arm identification in rare events

Arxiv

0+阅读 · 2023年3月14日

Sensitivity Analysis with the $R^2$-calculus

Arxiv

0+阅读 · 2023年3月13日

Doubly Robust Estimation under Covariate-Induced Dependent Left Truncation

Arxiv

0+阅读 · 2023年3月13日

Trade-offs in Static and Dynamic Evaluation of Hierarchical Queries

Arxiv

0+阅读 · 2023年3月12日

A novel notion of barycenter for probability distributions based on optimal weak mass transport

Arxiv

0+阅读 · 2023年3月10日

相关基金

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

一类时滞非线性系统随机动力学与控制

国家自然科学基金

0+阅读 · 2012年12月31日

表面等离子体共振单核苷酸多态性分型新方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

磁性Pickering乳液界面流变学研究

国家自然科学基金

0+阅读 · 2008年12月31日

应用金属组学方法研究铂类抗肿瘤药物与生物体内分子的相互作用

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员