具有背景信息的最佳武器标识 (Semiparametric Best Arm Identification with Contextual Information) - 专知论文

会员服务 ·

0

INFORMS · ARM · 可辨认的 · 赌博机/老虎机 · 随机采样 ·

2022 年 11 月 3 日

Semiparametric Best Arm Identification with Contextual Information

翻译：具有背景信息的最佳武器标识

Masahiro Kato,Masaaki Imaizumi,Takuya Ishihara,Toru Kitagawa

We study best-arm identification with a fixed budget and contextual (covariate) information in stochastic multi-armed bandit problems. In each round, after observing contextual information, we choose a treatment arm using past observations and current context. Our goal is to identify the best treatment arm, a treatment arm with the maximal expected reward marginalized over the contextual distribution, with a minimal probability of misidentification. First, we derive semiparametric lower bounds of the misidentification probability for this problem, where we regard the gaps between the expected rewards of the best and suboptimal treatment arms as parameters of interest, and all other parameters, such as the expected rewards conditioned on contexts, as the nuisance parameters. We then develop the ``Contextual RS-AIPW strategy,'' which consists of the random sampling (RS) rule tracking a target allocation ratio and the recommendation rule using the augmented inverse probability weighting (AIPW) estimator. Our proposed Contextual RS-AIPW strategy is optimal because the upper bound for the probability of misidentification by the strategy matches the semiparametric lower bound, when the budget goes to infinity and the gaps converge to zero.

翻译：我们用固定的预算和背景(共变)信息研究在多武装土匪问题中的最佳武器识别方法; 在每轮中,在观察背景信息后,我们利用以往的观察和当前背景选择一个治疗臂; 我们的目标是确定最佳治疗臂,这是在背景分布中边缘化的最大预期奖赏,最小的误分概率。首先,我们得出这一问题误分概率的半对数下限,其中我们把最佳和次最佳处理臂的预期奖励作为利益参数,以及所有其他参数,例如以环境为条件的预期奖励作为骚扰参数。然后,我们制定“传统RS-AIPW战略, 由随机抽样规则(RS) 跟踪目标分配比率和建议规则组成, 使用反概率加权( AIPW) 估计器。我们提议的RS-AIPW 环境战略是最佳的,因为战略误分辨的可能性上限在预算达到零定值时, 与准偏差的底限值相符。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

中国产石竹科无心菜属（Arenaria）的分类学研究

国家自然科学基金

0+阅读 · 2014年12月31日

内质网应激介导小胶质细胞自噬在脑出血炎症反应中的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

带跳扩散模型的非参数统计推断研究

国家自然科学基金

0+阅读 · 2013年12月31日

聚咔唑聚芴盘状高分子液晶的合成与光电性能

国家自然科学基金

0+阅读 · 2012年12月31日

电针干预慢性应激模型大鼠CREB-BDNF通路表观遗传学表达的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

超薄活性层有机太阳能电池研究

国家自然科学基金

0+阅读 · 2012年12月31日

西沙群岛四种海绵新颖结构抗肿瘤活性成分的发现研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

新型酰胺衍生物合成与抑菌活性研究

国家自然科学基金

0+阅读 · 2009年12月31日

钽铌酸钾一维纳米材料的取向控制及光电导效应研究

国家自然科学基金

0+阅读 · 2009年12月31日

Optimal Fault-Tolerant Data Fusion in Sensor Networks: Fundamental Limits and Efficient Algorithms

Optimal Fault-Tolerant Data Fusion in Sensor Networks: Fundamental Limits and Efficient Algorithms

Arxiv

0+阅读 · 2022年12月22日

Censoring heavy-tail count distributions for parameters estimation with an application to stable distributions

Arxiv

0+阅读 · 2022年12月22日

The Gaia AVU-GSR parallel solver: preliminary studies of a LSQR-based application in perspective of exascale systems

Arxiv

0+阅读 · 2022年12月22日

A Theoretical Study of The Effects of Adversarial Attacks on Sparse Regression

Arxiv

0+阅读 · 2022年12月22日

Online Statistical Inference for Matrix Contextual Bandit

Arxiv

0+阅读 · 2022年12月21日

Efficient Nonparametric Estimation of Incremental Propensity Score Effects with Clustered Interference

Arxiv

0+阅读 · 2022年12月21日

Development of robust X-bar charts with unequal sample sizes

Arxiv

0+阅读 · 2022年12月21日

Semisoft Task Clustering for Multi-Task Learning

Arxiv

0+阅读 · 2022年11月28日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

VIP会员

文章信息

相关主题

赌博机/老虎机

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

相关论文

Optimal Fault-Tolerant Data Fusion in Sensor Networks: Fundamental Limits and Efficient Algorithms

Optimal Fault-Tolerant Data Fusion in Sensor Networks: Fundamental Limits and Efficient Algorithms

Arxiv

0+阅读 · 2022年12月22日

Censoring heavy-tail count distributions for parameters estimation with an application to stable distributions

Arxiv

0+阅读 · 2022年12月22日

The Gaia AVU-GSR parallel solver: preliminary studies of a LSQR-based application in perspective of exascale systems

Arxiv

0+阅读 · 2022年12月22日

A Theoretical Study of The Effects of Adversarial Attacks on Sparse Regression

Arxiv

0+阅读 · 2022年12月22日

Online Statistical Inference for Matrix Contextual Bandit

Arxiv

0+阅读 · 2022年12月21日

Efficient Nonparametric Estimation of Incremental Propensity Score Effects with Clustered Interference

Arxiv

0+阅读 · 2022年12月21日

Development of robust X-bar charts with unequal sample sizes

Arxiv

0+阅读 · 2022年12月21日

Semisoft Task Clustering for Multi-Task Learning

Arxiv

0+阅读 · 2022年11月28日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

相关基金

中国产石竹科无心菜属（Arenaria）的分类学研究

国家自然科学基金

0+阅读 · 2014年12月31日

内质网应激介导小胶质细胞自噬在脑出血炎症反应中的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

带跳扩散模型的非参数统计推断研究

国家自然科学基金

0+阅读 · 2013年12月31日

聚咔唑聚芴盘状高分子液晶的合成与光电性能

国家自然科学基金

0+阅读 · 2012年12月31日

电针干预慢性应激模型大鼠CREB-BDNF通路表观遗传学表达的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

超薄活性层有机太阳能电池研究

国家自然科学基金

0+阅读 · 2012年12月31日

西沙群岛四种海绵新颖结构抗肿瘤活性成分的发现研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

新型酰胺衍生物合成与抑菌活性研究

国家自然科学基金

0+阅读 · 2009年12月31日

钽铌酸钾一维纳米材料的取向控制及光电导效应研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员