Bayesian 固定预算 (Bayesian Fixed-Budget Best-Arm Identification) - 专知论文

会员服务 ·

0

可辨认的 · 赌博机/老虎机 · 情景 · 优化器 · 频率主义学派 ·

2022 年 11 月 15 日

Bayesian Fixed-Budget Best-Arm Identification

翻译：Bayesian 固定预算

Alexia Atsidakou,Sumeet Katariya,Sujay Sanghavi,Branislav Kveton

Fixed-budget best-arm identification (BAI) is a bandit problem where the learning agent maximizes the probability of identifying the optimal arm after a fixed number of observations. In this work, we initiate the study of this problem in the Bayesian setting. We propose a Bayesian elimination algorithm and derive an upper bound on the probability that it fails to identify the optimal arm. The bound reflects the quality of the prior and is the first such bound in this setting. We prove it using a frequentist-like argument, where we carry the prior through, and then integrate out the random bandit instance at the end. Our upper bound asymptotically matches a newly established lower bound for $2$ arms. Our experimental results show that Bayesian elimination is superior to frequentist methods and competitive with the state-of-the-art Bayesian algorithms that have no guarantees in our setting.

翻译：固定预算最佳武器识别( BAI) 是一个土匪问题, 学习代理商在固定的观察次数后, 将确定最佳武器的最佳武器的可能性最大化。在这项工作中, 我们开始在巴伊西亚环境中研究这一问题。我们建议采用巴伊西亚消除算法, 并根据它未能确定最佳武器的可能性得出一个上限。捆绑反映了先前武器的质量, 并且是这个环境中第一个这样的约束。我们用一种经常式的比喻来证明它, 我们先从中走过, 然后在结尾处将随机的土匪比方整合出来。我们的上层正同时匹配一个新建的低价武器约束。我们的实验结果显示, 巴伊西亚消除方法优于常态方法, 并且与最先进的巴伊西亚人算法相比, 在我们的环境下没有保障。

0

相关内容

可辨认的

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

126+阅读 · 2022年4月21日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

高速列车运行条件下轮对轴承的故障行为分析与表征方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

非线性脉冲随机系统的有限时间稳定、噪声镇定与不连续控制

国家自然科学基金

0+阅读 · 2013年12月31日

以石墨烯构建二维水通道正渗透膜与内浓差极化消除机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于正交投影迭代学习的高频响直线伺服系统重复性扰动辨识研究

国家自然科学基金

0+阅读 · 2013年12月31日

非线性不连续系统的稳定与镇定

国家自然科学基金

0+阅读 · 2008年12月31日

Risk Sensitive Dead-end Identification in Safety-Critical Offline Reinforcement Learning

Arxiv

0+阅读 · 2023年1月13日

A fully Bayesian sparse polynomial chaos expansion approach with joint priors on the coefficients and global selection of terms

Arxiv

0+阅读 · 2023年1月13日

Efficient and robust transfer learning of optimal individualized treatment regimes with right-censored survival data

Arxiv

0+阅读 · 2023年1月13日

Improving Inference Performance of Machine Learning with the Divide-and-Conquer Principle

Improving Inference Performance of Machine Learning with the Divide-and-Conquer Principle

Arxiv

0+阅读 · 2023年1月12日

A Polynomial-time, Truthful, Individually Rational and Budget Balanced Ridesharing Mechanism

Arxiv

0+阅读 · 2023年1月11日

VIP会员

文章信息

相关主题

赌博机/老虎机

频率主义学派

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

126+阅读 · 2022年4月21日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型中的事件抽取：方法、模态与未来展望的全面综述

美海军作战管理系统：变革战场空间的二十年

【MIT博士论文】以语言为中心的医学影像理解

俄罗斯“沙希德”/“天竺葵”攻击无人机

相关资讯

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

Risk Sensitive Dead-end Identification in Safety-Critical Offline Reinforcement Learning

Arxiv

0+阅读 · 2023年1月13日

A fully Bayesian sparse polynomial chaos expansion approach with joint priors on the coefficients and global selection of terms

Arxiv

0+阅读 · 2023年1月13日

Efficient and robust transfer learning of optimal individualized treatment regimes with right-censored survival data

Arxiv

0+阅读 · 2023年1月13日

Improving Inference Performance of Machine Learning with the Divide-and-Conquer Principle

Improving Inference Performance of Machine Learning with the Divide-and-Conquer Principle

Arxiv

0+阅读 · 2023年1月12日

A Polynomial-time, Truthful, Individually Rational and Budget Balanced Ridesharing Mechanism

Arxiv

0+阅读 · 2023年1月11日

相关基金

高速列车运行条件下轮对轴承的故障行为分析与表征方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

非线性脉冲随机系统的有限时间稳定、噪声镇定与不连续控制

国家自然科学基金

0+阅读 · 2013年12月31日

以石墨烯构建二维水通道正渗透膜与内浓差极化消除机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于正交投影迭代学习的高频响直线伺服系统重复性扰动辨识研究

国家自然科学基金

0+阅读 · 2013年12月31日

非线性不连续系统的稳定与镇定

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员