使用分离装置,在事先不知情的情况下使用分离装置的因果强盗 (Causal Bandits without prior knowledge using separating sets) - 专知论文

会员服务 ·

0

赌博机/老虎机 · 分离的 · 知识 (knowledge) · 估计/估计量 · 离散化 ·

2022 年 9 月 29 日

Causal Bandits without prior knowledge using separating sets

翻译：使用分离装置,在事先不知情的情况下使用分离装置的因果强盗

Arnoud A. W. M. de Kroon,Danielle Belgrave,Joris M. Mooij

The Causal Bandit is a variant of the classic Bandit problem where an agent must identify the best action in a sequential decision-making process, where the reward distribution of the actions displays a non-trivial dependence structure that is governed by a causal model. Methods proposed for this problem thus far in the literature rely on exact prior knowledge of the full causal graph. We formulate new causal bandit algorithms that no longer necessarily rely on prior causal knowledge. Instead, they utilize an estimator based on separating sets, which we can find using simple conditional independence tests or causal discovery methods. We show that, given a true separating set, for discrete i.i.d. data, this estimator is unbiased, and has variance which is upper bounded by that of the sample mean. We develop algorithms based on Thompson Sampling and UCB for discrete and Gaussian models respectively and show increased performance on simulation data as well as on a bandit drawing from real-world protein signaling data.

翻译：Causal Bandit是典型的土匪问题的变体,在这种变体中,代理人必须确定一个连续决策过程中的最佳行动,行动的奖赏分配显示的是非三角依赖结构,这种结构受因果模式的制约。迄今为止在文献中为这一问题提出的方法依赖于对全因果图的准确事先了解。我们制定了新的因果盗匪算法,这种算法不必再依赖先前的因果知识。相反,它们使用一个基于分离的测算器,我们可以使用简单的有条件独立测试或因果发现方法找到。我们显示,根据真实的分离数据集,对于离散的如.d.数据来说,这个测算器是不带偏见的,而且存在差异,受抽样平均值的比重。我们分别根据Thompson Sampling和UCB为离散和高斯模型制定算法,并显示模拟数据以及从真实世界蛋白质信号数据中提取的浮标的浮标的测算法的性提高。

0

相关内容

赌博机/老虎机

赌博机/老虎机

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

51+阅读 · 2022年10月22日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

SIRT1介导的Resveratrol对糖尿病视网膜病变“代谢记忆”的作用及其机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

SirT1介导硫化氢对自发性高血压大鼠血管平滑肌细胞增殖的抑制效应

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

DNA硫化修饰的抗氧化机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

具有临界指数的Schrodinger-Poisson系统的解

国家自然科学基金

0+阅读 · 2013年12月31日

基于机器学习和融合算法的全球陆表植被覆盖度估算方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

ABCA1甲基化在动脉粥样硬化中的作用及miR-155靶向调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

掺杂最外层具有6s2电子结构的元素提升Ca5Al2Sb6基材料热电性能的理论研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

益气养阴活血中药对胰岛β细胞脂凋亡的作用

国家自然科学基金

0+阅读 · 2011年12月31日

When Privacy Meets Partial Information: A Refined Analysis of Differentially Private Bandits

Arxiv

0+阅读 · 2022年11月4日

Distributionally Robust Causal Inference with Observational Data

Arxiv

0+阅读 · 2022年11月4日

Information Design for Differential Privacy

Arxiv

0+阅读 · 2022年11月4日

Dynamic Causal Effects Evaluation in A/B Testing with a Reinforcement Learning Framework

Arxiv

0+阅读 · 2022年11月3日

Speed Up the Cold-Start Learning in Two-Sided Bandits with Many Arms

Arxiv

0+阅读 · 2022年11月3日

A Survey of Deep Causal Models

Arxiv

0+阅读 · 2022年11月3日

A Bayesian Semiparametric Method For Estimating Causal Quantile Effects

Arxiv

0+阅读 · 2022年11月3日

A coherence parameter characterizing generative compressed sensing with Fourier measurements

Arxiv

0+阅读 · 2022年11月3日

On the Interaction Between Differential Privacy and Gradient Compression in Deep Learning

Arxiv

0+阅读 · 2022年11月1日

Beyond the Best: Estimating Distribution Functionals in Infinite-Armed Bandits

Arxiv

0+阅读 · 2022年11月1日

VIP会员

文章信息

相关主题

赌博机/老虎机

知识 (knowledge)

估计/估计量

相关VIP内容

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

51+阅读 · 2022年10月22日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《美国海军陆战队软件定义网络应用案例：分布式防火墙自动化系统》148页

《多体环境下定位导航授时（PNT）系统研究》228页

软件定义无线电（SDR）：商业与军事领域的技术、应用及未来趋势

《攻势防空作战中无人追击者/规避者最优轨迹研究（含动态交战区建模）》95页

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

When Privacy Meets Partial Information: A Refined Analysis of Differentially Private Bandits

Arxiv

0+阅读 · 2022年11月4日

Distributionally Robust Causal Inference with Observational Data

Arxiv

0+阅读 · 2022年11月4日

Information Design for Differential Privacy

Arxiv

0+阅读 · 2022年11月4日

Dynamic Causal Effects Evaluation in A/B Testing with a Reinforcement Learning Framework

Arxiv

0+阅读 · 2022年11月3日

Speed Up the Cold-Start Learning in Two-Sided Bandits with Many Arms

Arxiv

0+阅读 · 2022年11月3日

A Survey of Deep Causal Models

Arxiv

0+阅读 · 2022年11月3日

A Bayesian Semiparametric Method For Estimating Causal Quantile Effects

Arxiv

0+阅读 · 2022年11月3日

A coherence parameter characterizing generative compressed sensing with Fourier measurements

Arxiv

0+阅读 · 2022年11月3日

On the Interaction Between Differential Privacy and Gradient Compression in Deep Learning

Arxiv

0+阅读 · 2022年11月1日

Beyond the Best: Estimating Distribution Functionals in Infinite-Armed Bandits

Arxiv

0+阅读 · 2022年11月1日

相关基金

SIRT1介导的Resveratrol对糖尿病视网膜病变“代谢记忆”的作用及其机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

SirT1介导硫化氢对自发性高血压大鼠血管平滑肌细胞增殖的抑制效应

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

DNA硫化修饰的抗氧化机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

具有临界指数的Schrodinger-Poisson系统的解

国家自然科学基金

0+阅读 · 2013年12月31日

基于机器学习和融合算法的全球陆表植被覆盖度估算方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

ABCA1甲基化在动脉粥样硬化中的作用及miR-155靶向调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

掺杂最外层具有6s2电子结构的元素提升Ca5Al2Sb6基材料热电性能的理论研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

益气养阴活血中药对胰岛β细胞脂凋亡的作用

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员