超越最佳:估计无限武装强盗中分配功能 (Beyond the Best: Estimating Distribution Functionals in Infinite-Armed Bandits) - 专知论文

会员服务 ·

0

估计/估计量 · 赌博机/老虎机 · 泛函 · 可辨认的 · ARM ·

2022 年 11 月 1 日

Beyond the Best: Estimating Distribution Functionals in Infinite-Armed Bandits

翻译：超越最佳:估计无限武装强盗中分配功能

Yifei Wang,Tavor Baharav,Yanjun Han,Jiantao Jiao,David Tse

In the infinite-armed bandit problem, each arm's average reward is sampled from an unknown distribution, and each arm can be sampled further to obtain noisy estimates of the average reward of that arm. Prior work focuses on identifying the best arm, i.e., estimating the maximum of the average reward distribution. We consider a general class of distribution functionals beyond the maximum, and propose unified meta algorithms for both the offline and online settings, achieving optimal sample complexities. We show that online estimation, where the learner can sequentially choose whether to sample a new or existing arm, offers no advantage over the offline setting for estimating the mean functional, but significantly reduces the sample complexity for other functionals such as the median, maximum, and trimmed mean. The matching lower bounds utilize several different Wasserstein distances. For the special case of median estimation, we identify a curious thresholding phenomenon on the indistinguishability between Gaussian convolutions with respect to the noise level, which may be of independent interest.

翻译：在无尽武装土匪问题中,每个手臂的平均奖赏都是从未知的分布中抽样的,每个手臂都可以进一步抽样,以获得该手臂平均奖赏的噪音估计值。先前的工作重点是确定最好的手臂,即估计平均奖赏分配的最大值。我们考虑的是超出最大范围的分布功能的一般类别,并为离线和在线设置提出统一的元算法,实现最佳的抽样复杂性。我们显示在线估算,即学习者可以依次选择是否采样一个新的或现有的手臂,在离线设置上没有优势来估计平均功能,但大大降低了其他功能的抽样复杂性,如中位、最大和中位平均值。匹配的下界利用了不同的瓦塞斯坦距离。对于中位估计的特殊案例,我们确定了高萨共进集团之间在噪音水平上的不易分辨现象,这也许具有独立的兴趣。

0

相关内容

估计/估计量

估计/估计量

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Biot固结方程的有限元方法及快速算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于神经网络和群体智能的稀疏表示算法研究

国家自然科学基金

1+阅读 · 2014年12月31日

非均质量子器件Schr？dinger-Poisson系统多尺度分析与算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

脉冲型纵向磁场下多通道非稳态核磁共振研究

国家自然科学基金

0+阅读 · 2014年12月31日

混凝土Weibull统计尺寸效应理论模型改进研究

国家自然科学基金

0+阅读 · 2013年12月31日

GIT1CC2结构域在保护脊髓缺血再灌注损伤（SCII）中的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

受时变对流扩散方程约束的最优控制问题的SUPG方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

碳酸盐岩区硫化物尾矿中重金属赋存状态研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于DEA的成本效率模型和Malmquist-Luenberger理论及其在银行业中的应用研究

国家自然科学基金

1+阅读 · 2012年12月31日

On Design of Polyhedral Estimates in Linear Inverse Problems

Arxiv

0+阅读 · 2022年12月23日

Approximating inverse cumulative distribution functions to produce approximate random variables

Arxiv

0+阅读 · 2022年12月22日

Censoring heavy-tail count distributions for parameters estimation with an application to stable distributions

Arxiv

0+阅读 · 2022年12月22日

A Theoretical Study of The Effects of Adversarial Attacks on Sparse Regression

Arxiv

0+阅读 · 2022年12月21日

Ensemble learning techniques for intrusion detection system in the context of cybersecurity

Arxiv

0+阅读 · 2022年12月21日

A Bayesian Mixture Model Approach to Expected Possession Values in Rugby League

Arxiv

0+阅读 · 2022年12月21日

Space-Terrestrial Cooperation Over Spatially Correlated Channels Relying on Imperfect Channel Estimates: Uplink Performance Analysis and Optimization

Arxiv

0+阅读 · 2022年12月21日

Functional Linear Regression of Cumulative Distribution Functions

Arxiv

0+阅读 · 2022年12月21日

Aggregate Markov models in life insurance: estimation via the EM algorithm

Arxiv

0+阅读 · 2022年12月20日

Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond

Arxiv

21+阅读 · 2021年9月2日

VIP会员

文章信息

相关主题

估计/估计量

赌博机/老虎机

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

《生态建模密码破译：建模与编程实践》美陆军最新报告

大模型解决方案白皮书：社交陪伴场景全流程落地指南

面向具身操作的视觉-语言-动作模型综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

相关论文

On Design of Polyhedral Estimates in Linear Inverse Problems

Arxiv

0+阅读 · 2022年12月23日

Approximating inverse cumulative distribution functions to produce approximate random variables

Arxiv

0+阅读 · 2022年12月22日

Censoring heavy-tail count distributions for parameters estimation with an application to stable distributions

Arxiv

0+阅读 · 2022年12月22日

A Theoretical Study of The Effects of Adversarial Attacks on Sparse Regression

Arxiv

0+阅读 · 2022年12月21日

Ensemble learning techniques for intrusion detection system in the context of cybersecurity

Arxiv

0+阅读 · 2022年12月21日

A Bayesian Mixture Model Approach to Expected Possession Values in Rugby League

Arxiv

0+阅读 · 2022年12月21日

Space-Terrestrial Cooperation Over Spatially Correlated Channels Relying on Imperfect Channel Estimates: Uplink Performance Analysis and Optimization

Arxiv

0+阅读 · 2022年12月21日

Functional Linear Regression of Cumulative Distribution Functions

Arxiv

0+阅读 · 2022年12月21日

Aggregate Markov models in life insurance: estimation via the EM algorithm

Arxiv

0+阅读 · 2022年12月20日

Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond

Arxiv

21+阅读 · 2021年9月2日

相关基金

Biot固结方程的有限元方法及快速算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于神经网络和群体智能的稀疏表示算法研究

国家自然科学基金

1+阅读 · 2014年12月31日

非均质量子器件Schr？dinger-Poisson系统多尺度分析与算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

脉冲型纵向磁场下多通道非稳态核磁共振研究

国家自然科学基金

0+阅读 · 2014年12月31日

混凝土Weibull统计尺寸效应理论模型改进研究

国家自然科学基金

0+阅读 · 2013年12月31日

GIT1CC2结构域在保护脊髓缺血再灌注损伤（SCII）中的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

受时变对流扩散方程约束的最优控制问题的SUPG方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

碳酸盐岩区硫化物尾矿中重金属赋存状态研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于DEA的成本效率模型和Malmquist-Luenberger理论及其在银行业中的应用研究

国家自然科学基金

1+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员