知识梯度算法在有限时间性表现 (On the Finite-Time Performance of the Knowledge Gradient Algorithm) - 专知论文

会员服务 ·

0

知识梯度 · Performer · 知识 (knowledge) · Analysis · SimPLe ·

2022 年 6 月 14 日

On the Finite-Time Performance of the Knowledge Gradient Algorithm

翻译：知识梯度算法在有限时间性表现

Yanwen Li,Siyang Gao

The knowledge gradient (KG) algorithm is a popular and effective algorithm for the best arm identification (BAI) problem. Due to the complex calculation of KG, theoretical analysis of this algorithm is difficult, and existing results are mostly about the asymptotic performance of it, e.g., consistency, asymptotic sample allocation, etc. In this research, we present new theoretical results about the finite-time performance of the KG algorithm. Under independent and normally distributed rewards, we derive lower bounds and upper bounds for the probability of error and simple regret of the algorithm. With these bounds, existing asymptotic results become simple corollaries. We also show the performance of the algorithm for the multi-armed bandit (MAB) problem. These developments not only extend the existing analysis of the KG algorithm, but can also be used to analyze other improvement-based algorithms. Last, we use numerical experiments to further demonstrate the finite-time behavior of the KG algorithm.

翻译：知识梯度( KG) 算法是用于最佳手臂识别( BAI) 问题的流行而有效的算法。由于对 KG 的计算十分复杂, 对这一算法的理论分析很困难, 现有结果主要是关于它的无症状性能, 例如一致性、无症状样本分配等。在这个研究中, 我们介绍了关于 KG 算法的有限时间性能的新的理论结果。在独立和通常分配的奖励下, 我们得出了错误概率和简单遗憾的下限和上限。随着这些界限, 现有的无症状结果成为简单的卷轴。我们还展示了多臂强盗问题算法的性能。这些发展不仅扩展了对 KG 算法的现有分析, 还可以用来分析其他基于改进的算法。最后, 我们用数字实验来进一步证明 KG 算法的有限时间行为。

0

相关内容

知识梯度

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

图论中的整数流与圆流

国家自然科学基金

0+阅读 · 2015年12月31日

稀土上转换纳米颗粒对间充质干细胞命运的调控

国家自然科学基金

0+阅读 · 2015年12月31日

二维系统的有限频设计方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

柽柳Dof转录因子的耐盐调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

可见光波段石墨烯调Q/锁模掺镨下转换光纤激光器研究

国家自然科学基金

0+阅读 · 2012年12月31日

肠干细胞候选标志物 β1-integrin调控Hedgehog信号通路在结肠癌发生中作用及机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

碳纳米管自旋电子学器件研究

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

On the Expressiveness of a Logic of Separated Relations

Arxiv

0+阅读 · 2022年8月2日

Effects of Graph Convolutions in Multi-layer Networks

Arxiv

0+阅读 · 2022年8月1日

A performance contextualization approach to validating camera models for robot simulation

Arxiv

0+阅读 · 2022年8月1日

The Effects of Data Quality on Machine Learning Performance

Arxiv

0+阅读 · 2022年8月1日

The Geometry of Adversarial Training in Binary Classification

Arxiv

0+阅读 · 2022年8月1日

Locomotion Policy Guided Traversability Learning using Volumetric Representations of Complex Environments

Arxiv

0+阅读 · 2022年8月1日

Parameter-Parallel Distributed Variational Quantum Algorithm

Arxiv

0+阅读 · 2022年7月31日

On the bottleneck stability of rank decompositions of multi-parameter persistence modules

Arxiv

0+阅读 · 2022年7月30日

The Effects of Data Quality on ML-Model Performance

Arxiv

0+阅读 · 2022年7月29日

A Survey of Learning on Small Data

Arxiv

19+阅读 · 2022年7月29日

VIP会员

文章信息

相关主题

知识 (knowledge)

相关VIP内容

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【NTU博士论文】反事实推理在多模态对话生成中的应用

基于强化学习的智能体化搜索全面综述：基础、角色、优化、评估与应用

ICCV最佳论文出炉，朱俊彦团队用砖块积木摘得桂冠

面向具身操作的高效视觉–语言–动作模型：系统综述

相关资讯

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

相关论文

On the Expressiveness of a Logic of Separated Relations

Arxiv

0+阅读 · 2022年8月2日

Effects of Graph Convolutions in Multi-layer Networks

Arxiv

0+阅读 · 2022年8月1日

A performance contextualization approach to validating camera models for robot simulation

Arxiv

0+阅读 · 2022年8月1日

The Effects of Data Quality on Machine Learning Performance

Arxiv

0+阅读 · 2022年8月1日

The Geometry of Adversarial Training in Binary Classification

Arxiv

0+阅读 · 2022年8月1日

Locomotion Policy Guided Traversability Learning using Volumetric Representations of Complex Environments

Arxiv

0+阅读 · 2022年8月1日

Parameter-Parallel Distributed Variational Quantum Algorithm

Arxiv

0+阅读 · 2022年7月31日

On the bottleneck stability of rank decompositions of multi-parameter persistence modules

Arxiv

0+阅读 · 2022年7月30日

The Effects of Data Quality on ML-Model Performance

Arxiv

0+阅读 · 2022年7月29日

A Survey of Learning on Small Data

Arxiv

19+阅读 · 2022年7月29日

相关基金

图论中的整数流与圆流

国家自然科学基金

0+阅读 · 2015年12月31日

稀土上转换纳米颗粒对间充质干细胞命运的调控

国家自然科学基金

0+阅读 · 2015年12月31日

二维系统的有限频设计方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

柽柳Dof转录因子的耐盐调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

可见光波段石墨烯调Q/锁模掺镨下转换光纤激光器研究

国家自然科学基金

0+阅读 · 2012年12月31日

肠干细胞候选标志物 β1-integrin调控Hedgehog信号通路在结肠癌发生中作用及机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

碳纳米管自旋电子学器件研究

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员