使用 Causal 强盗对调适式利用 d- 分隔器 (Adaptively Exploiting d-Separators with Causal Bandits) - 专知论文

会员服务 ·

0

赌博机/老虎机 · 观测变量 · 知识 (knowledge) · 优化器 · Performer ·

2022 年 5 月 24 日

Adaptively Exploiting d-Separators with Causal Bandits

翻译：使用 Causal 强盗对调适式利用 d- 分隔器

Blair Bilodeau,Linbo Wang,Daniel M. Roy

from arxiv, 33 pages, 3 figures

Multi-armed bandit problems provide a framework to identify the optimal intervention over a sequence of repeated experiments. Without additional assumptions, minimax optimal performance (measured by cumulative regret) is well-understood. With access to additional observed variables that d-separate the intervention from the outcome (i.e., they are a d-separator), recent "causal bandit" algorithms provably incur less regret. However, in practice it is desirable to be agnostic to whether observed variables are a d-separator. Ideally, an algorithm should be adaptive; that is, perform nearly as well as an algorithm with oracle knowledge of the presence or absence of a d-separator. In this work, we formalize and study this notion of adaptivity, and provide a novel algorithm that simultaneously achieves (a) optimal regret when a d-separator is observed, improving on classical minimax algorithms, and (b) significantly smaller regret than recent causal bandit algorithms when the observed variables are not a d-separator. Crucially, our algorithm does not require any oracle knowledge of whether a d-separator is observed. We also generalize this adaptivity to other conditions, such as the front-door criterion.

翻译：多武装土匪问题提供了一个框架,用以确定对一系列重复实验的最佳干预。没有额外的假设, 小型最佳性能( 以累积的遗憾衡量) 是完全理解的。通过访问额外的观测变量, d 将干预与结果分离( 即它们是一个分离器), 最近的“ causal 土匪” 算法可能会产生较少的遗憾。但是, 在实际中, 最好是对观测到的变量是否为 d- 分离器持谨慎态度。理想的情况是, 一种算法应该是适应性的; 也就是说, 一种接近于对 d- 分离器的存在或缺失有某种了解的算法。在这项工作中, 我们正式确定和研究这一适应性概念, 并提供一种新的算法, 既能(a) 观察到 d- 分离器, 也能够改善经典的迷你算法, 并且 (b) 当观察到的变量不是分隔器时, 要比最近的因果算法要小得多。关键是, 我们的算法并不要求任何对一般标准进行调整, 。

0

相关内容

赌博机/老虎机

赌博机/老虎机

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【因果基础】Causality Basics，36页ppt

专知会员服务

52+阅读 · 2021年8月8日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

罗巴代数的表示和罗巴代数在operad中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

MicroRNA调控BACE1在AD发病中的作用与机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

共激活蛋白在视网膜感光细胞发育中的分子调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

稳定度条件与环的正则性、clean性

国家自然科学基金

0+阅读 · 2012年12月31日

CRMP2对MCAO大鼠的神经保护作用

国家自然科学基金

0+阅读 · 2012年12月31日

Hint1与Girdin/Akt及Src信号通路串话在肝癌细胞增殖中的调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

非倍测度函数空间上的一些问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于化学反应飞秒相干控制的飞秒时间分辨相干Raman光谱仪的研制

国家自然科学基金

0+阅读 · 2011年12月31日

RNAs激活前列腺癌靶基因表达的机制及其与miRNA关系的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Sample-dependent Adaptive Temperature Scaling for Improved Calibration

Sample-dependent Adaptive Temperature Scaling for Improved Calibration

Arxiv

0+阅读 · 2022年7月13日

Contextual Bandits with Large Action Spaces: Made Practical

Arxiv

0+阅读 · 2022年7月12日

Differentially Private Linear Bandits with Partial Distributed Feedback

Arxiv

0+阅读 · 2022年7月12日

Improving the Robustness and Generalization of Deep Neural Network with Confidence Threshold Reduction

Arxiv

0+阅读 · 2022年7月12日

Calibrating Class Weights with Multi-Modal Information for Partial Video Domain Adaptation

Arxiv

0+阅读 · 2022年7月11日

Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions

Arxiv

0+阅读 · 2022年7月10日

Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2022年7月8日

Understanding Gradual Domain Adaptation: Improved Analysis, Optimal Path and Beyond

Arxiv

0+阅读 · 2022年7月7日

Emergence of Novelty in Evolutionary Algorithms

Arxiv

0+阅读 · 2022年6月27日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

VIP会员

文章信息

相关主题

赌博机/老虎机

知识 (knowledge)

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【因果基础】Causality Basics，36页ppt

专知会员服务

52+阅读 · 2021年8月8日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

机器人领域中最佳的三维场景表示是什么？——从几何表示到基础模型

《多域作战兵棋推演：运用形态学分析与人工智能加强国防人员训练》

【博士论文】快速高效的归一化流及其在图像生成模型中的应用

仿生机器人技术的军事应用

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Sample-dependent Adaptive Temperature Scaling for Improved Calibration

Sample-dependent Adaptive Temperature Scaling for Improved Calibration

Arxiv

0+阅读 · 2022年7月13日

Contextual Bandits with Large Action Spaces: Made Practical

Arxiv

0+阅读 · 2022年7月12日

Differentially Private Linear Bandits with Partial Distributed Feedback

Arxiv

0+阅读 · 2022年7月12日

Improving the Robustness and Generalization of Deep Neural Network with Confidence Threshold Reduction

Arxiv

0+阅读 · 2022年7月12日

Calibrating Class Weights with Multi-Modal Information for Partial Video Domain Adaptation

Arxiv

0+阅读 · 2022年7月11日

Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions

Arxiv

0+阅读 · 2022年7月10日

Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2022年7月8日

Understanding Gradual Domain Adaptation: Improved Analysis, Optimal Path and Beyond

Arxiv

0+阅读 · 2022年7月7日

Emergence of Novelty in Evolutionary Algorithms

Arxiv

0+阅读 · 2022年6月27日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

相关基金

罗巴代数的表示和罗巴代数在operad中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

MicroRNA调控BACE1在AD发病中的作用与机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

共激活蛋白在视网膜感光细胞发育中的分子调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

稳定度条件与环的正则性、clean性

国家自然科学基金

0+阅读 · 2012年12月31日

CRMP2对MCAO大鼠的神经保护作用

国家自然科学基金

0+阅读 · 2012年12月31日

Hint1与Girdin/Akt及Src信号通路串话在肝癌细胞增殖中的调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

非倍测度函数空间上的一些问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于化学反应飞秒相干控制的飞秒时间分辨相干Raman光谱仪的研制

国家自然科学基金

0+阅读 · 2011年12月31日

RNAs激活前列腺癌靶基因表达的机制及其与miRNA关系的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员