对基于非CBUB的顶顶上两高算法进行非症状分析 (Non-Asymptotic Analysis of a UCB-based Top Two Algorithm) - 专知论文

会员服务 ·

0

Analysis · Performance · 样本 · ARM · 赌博机/老虎机 ·

2023 年 1 月 25 日

Non-Asymptotic Analysis of a UCB-based Top Two Algorithm

翻译：对基于非CBUB的顶顶上两高算法进行非症状分析

Marc Jourdan,Rémy Degenne

from arxiv, 32 pages, 5 figures, 3 tables

A Top Two sampling rule for bandit identification is a method which selects the next arm to sample from among two candidate arms, a leader and a challenger. Due to their simplicity and good empirical performance, they have received increased attention in recent years. However, for fixed-confidence best arm identification, theoretical guarantees for Top Two methods have only been obtained in the asymptotic regime, when the error level vanishes. In this paper, we derive the first non-asymptotic upper bound on the expected sample complexity of a Top Two algorithm, which holds for any error level. Our analysis highlights sufficient properties for a regret minimization algorithm to be used as leader. These properties are satisfied by the UCB algorithm, and our proposed UCB-based Top Two algorithm simultaneously enjoys non-asymptotic guarantees and competitive empirical performance.

翻译：盗匪识别的顶端二号抽样规则是一种方法,它从两个候选武器中选择下一个手臂作为样本,一个是领导者,另一个是挑战者。由于它们简单和良好的经验性表现,近年来它们受到越来越多的关注。然而,对于固定自信的最佳手臂识别,只有当误差水平消失时,才在无药可救制度中获得对顶端二号方法的理论保障。在本文中,我们从一个顶端二号算法的预期样本复杂性中得出第一个非被动上层约束,该算法将维持在任何错误水平上。我们的分析强调了用于将遗憾最小化算法用作领导者的足够特性。这些特性为UCB算法所满足,而我们提议的基于UCB的顶端二号算法同时享有非痛苦保证和竞争性经验性表现。

0

相关内容

Analysis

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

66+阅读 · 2023年2月15日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【经典书】算法设计与分析，727页pdf，Algorithms Design and Analysis，牛津大学出版社

【经典书】算法设计与分析，727页pdf，Algorithms Design and Analysis，牛津大学出版社

专知会员服务

137+阅读 · 2020年2月25日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

导热增强型复合相变材料的影响因素及传热机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于规范的企业竞争力演化的计算实验研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于Fermi-LAT和AMS-02的暗物质理论研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型FePt基纳米复合永磁材料的研究

国家自然科学基金

0+阅读 · 2013年12月31日

高维数据的假设检验

国家自然科学基金

0+阅读 · 2012年12月31日

Eulerian bond-cubic 模型渗流性质的数值研究

国家自然科学基金

0+阅读 · 2012年12月31日

离子印迹磁性纳米材料选择性净化重金属废水作用机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

多倍体西瓜果实番茄红素合成与代谢关键酶基因的表达分析

国家自然科学基金

0+阅读 · 2011年12月31日

ASPP2调节肝癌细胞上皮间质转化的研究

国家自然科学基金

0+阅读 · 2011年12月31日

车辆传动换挡过程非线性动力学建模与求解

国家自然科学基金

0+阅读 · 2009年12月31日

High-dimensional Censored Regression via the Penalized Tobit Likelihood

Arxiv

0+阅读 · 2023年3月17日

Liability regimes in the age of AI: a use-case driven analysis of the burden of proof

Arxiv

0+阅读 · 2023年3月17日

A Non-Asymptotic Framework for Approximate Message Passing in Spiked Models

Arxiv

0+阅读 · 2023年3月17日

CausalEGM: a general causal inference framework by encoding generative modeling

Arxiv

0+阅读 · 2023年3月16日

Hardness of the Generalized Coloring Numbers

Arxiv

0+阅读 · 2023年3月16日

Explaining Groups of Instances Counterfactually for XAI: A Use Case, Algorithm and User Study for Group-Counterfactuals

Arxiv

0+阅读 · 2023年3月16日

lmw: Linear Model Weights for Causal Inference

Arxiv

0+阅读 · 2023年3月15日

Reparameterization of extreme value framework for improved Bayesian workflow

Arxiv

0+阅读 · 2023年3月15日

A scaling-invariant algorithm for linear programming whose running time depends only on the constraint matrix

Arxiv

0+阅读 · 2023年3月15日

A Wholistic View of Continual Learning with Deep Neural Networks: Forgotten Lessons and the Bridge to Active and Open World Learning

Arxiv

36+阅读 · 2020年9月3日

VIP会员

文章信息

相关主题

赌博机/老虎机

相关VIP内容

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

66+阅读 · 2023年2月15日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【经典书】算法设计与分析，727页pdf，Algorithms Design and Analysis，牛津大学出版社

【经典书】算法设计与分析，727页pdf，Algorithms Design and Analysis，牛津大学出版社

专知会员服务

137+阅读 · 2020年2月25日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

数据要素发展报告(2025年)：附下载

人工智能代理提升战时舰船战备水平

【NeurIPS2025教程】大语言模型规划

NeurIPS 2025 教程：深度学习训练不稳定性的理论洞见

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

High-dimensional Censored Regression via the Penalized Tobit Likelihood

Arxiv

0+阅读 · 2023年3月17日

Liability regimes in the age of AI: a use-case driven analysis of the burden of proof

Arxiv

0+阅读 · 2023年3月17日

A Non-Asymptotic Framework for Approximate Message Passing in Spiked Models

Arxiv

0+阅读 · 2023年3月17日

CausalEGM: a general causal inference framework by encoding generative modeling

Arxiv

0+阅读 · 2023年3月16日

Hardness of the Generalized Coloring Numbers

Arxiv

0+阅读 · 2023年3月16日

Explaining Groups of Instances Counterfactually for XAI: A Use Case, Algorithm and User Study for Group-Counterfactuals

Arxiv

0+阅读 · 2023年3月16日

lmw: Linear Model Weights for Causal Inference

Arxiv

0+阅读 · 2023年3月15日

Reparameterization of extreme value framework for improved Bayesian workflow

Arxiv

0+阅读 · 2023年3月15日

A scaling-invariant algorithm for linear programming whose running time depends only on the constraint matrix

Arxiv

0+阅读 · 2023年3月15日

A Wholistic View of Continual Learning with Deep Neural Networks: Forgotten Lessons and the Bridge to Active and Open World Learning

Arxiv

36+阅读 · 2020年9月3日

相关基金

导热增强型复合相变材料的影响因素及传热机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于规范的企业竞争力演化的计算实验研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于Fermi-LAT和AMS-02的暗物质理论研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型FePt基纳米复合永磁材料的研究

国家自然科学基金

0+阅读 · 2013年12月31日

高维数据的假设检验

国家自然科学基金

0+阅读 · 2012年12月31日

Eulerian bond-cubic 模型渗流性质的数值研究

国家自然科学基金

0+阅读 · 2012年12月31日

离子印迹磁性纳米材料选择性净化重金属废水作用机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

多倍体西瓜果实番茄红素合成与代谢关键酶基因的表达分析

国家自然科学基金

0+阅读 · 2011年12月31日

ASPP2调节肝癌细胞上皮间质转化的研究

国家自然科学基金

0+阅读 · 2011年12月31日

车辆传动换挡过程非线性动力学建模与求解

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员