使用 Causal 强盗对调适式利用 d- 分隔器 (Adaptively Exploiting d-Separators with Causal Bandits) - 专知论文

会员服务 ·

0

赌博机/老虎机 · 观测变量 · Minimax · 优化器 · Performer ·

2022 年 10 月 26 日

Adaptively Exploiting d-Separators with Causal Bandits

翻译：使用 Causal 强盗对调适式利用 d- 分隔器

Blair Bilodeau,Linbo Wang,Daniel M. Roy

from arxiv, 29 pages, 3 figures. Camera ready version

Multi-armed bandit problems provide a framework to identify the optimal intervention over a sequence of repeated experiments. Without additional assumptions, minimax optimal performance (measured by cumulative regret) is well-understood. With access to additional observed variables that d-separate the intervention from the outcome (i.e., they are a d-separator), recent "causal bandit" algorithms provably incur less regret. However, in practice it is desirable to be agnostic to whether observed variables are a d-separator. Ideally, an algorithm should be adaptive; that is, perform nearly as well as an algorithm with oracle knowledge of the presence or absence of a d-separator. In this work, we formalize and study this notion of adaptivity, and provide a novel algorithm that simultaneously achieves (a) optimal regret when a d-separator is observed, improving on classical minimax algorithms, and (b) significantly smaller regret than recent causal bandit algorithms when the observed variables are not a d-separator. Crucially, our algorithm does not require any oracle knowledge of whether a d-separator is observed. We also generalize this adaptivity to other conditions, such as the front-door criterion.

翻译：多武装土匪问题提供了一个框架,用以确定对一系列重复实验的最佳干预。没有额外的假设, 小型最佳性能( 以累积的遗憾衡量) 是完全理解的。通过访问额外的观测变量, d 将干预与结果分离( 即它们是一个分离器), 最近的“ causal 土匪” 算法可能会产生较少的遗憾。但是, 在实际中, 最好是对观测到的变量是否为 d- 分离器持谨慎态度。理想的情况是, 一种算法应该是适应性的; 也就是说, 一种接近于对 d- 分离器的存在或缺失有某种了解的算法。在这项工作中, 我们正式确定和研究这一适应性概念, 并提供一种新的算法, 既能(a) 观察到 d- 分离器, 也能够改善经典的迷你算法, 并且 (b) 当观察到的变量不是分隔器时, 要比最近的因果算法要小得多。关键是, 我们的算法并不要求任何对一般标准进行调整, 。

0

相关内容

赌博机/老虎机

赌博机/老虎机

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

52+阅读 · 2022年10月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

RMND5A基因在遗传性泛发性色素异常症中的致病机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

Ca2+/PKC通路在PFOS诱导的小胶质细胞炎性活化中的意义

国家自然科学基金

0+阅读 · 2015年12月31日

IL-1β通过NF-κB/Lipocalin2调控大肠癌上皮间质转化的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

PDE-ODE无穷维耦合系统的镇定与控制

国家自然科学基金

0+阅读 · 2014年12月31日

IL-17基因多态性对肝移植受者环孢素代谢和药效的影响及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

功能性遗传变异调控BARD1/BRCA1泛素化通路的机制及与儿童神经母细胞瘤的关联研究

国家自然科学基金

0+阅读 · 2013年12月31日

KIBRA及APOE基因多态性对人脑记忆功能调控机制的多模态MRI研究

国家自然科学基金

0+阅读 · 2013年12月31日

CCND1基因rs9344位点多态性影响汉族女性宫颈癌易感性的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

细胞和个体水平上Vaspin与胰岛素抵抗相互关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

番茄广谱胁迫蛋白SlUSP介导抗灰霉病的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

Bayesian Semiparametric Markov Renewal Mixed Models for Vocalization Syntax

Arxiv

0+阅读 · 2022年12月14日

Multi-armed Bandit Learning on a Graph

Arxiv

0+阅读 · 2022年12月14日

Testing the Graph of a Gaussian Graphical Model

Arxiv

0+阅读 · 2022年12月13日

Multi-objective robust optimization using adaptive surrogate models for problems with mixed continuous-categorical parameters

Arxiv

0+阅读 · 2022年12月13日

Autoregressive Bandits

Arxiv

0+阅读 · 2022年12月12日

Retire: Robust Expectile Regression in High Dimensions

Arxiv

0+阅读 · 2022年12月11日

Bayesian Sparse Gaussian Mixture Model in High Dimensions

Arxiv

0+阅读 · 2022年12月10日

Networked Restless Bandits with Positive Externalities

Arxiv

0+阅读 · 2022年12月9日

Interpreting and Unifying Graph Neural Networks with An Optimization Framework

Arxiv

18+阅读 · 2021年1月28日

The Causal Learning of Retail Delinquency

Arxiv

15+阅读 · 2020年12月17日

VIP会员

文章信息

相关主题

赌博机/老虎机

相关VIP内容

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

52+阅读 · 2022年10月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

智能体化人工智能：架构、应用及未来发展方向的综合综述

《自主武器》365页书籍

联邦学习综述：多层次聚合技术的系统分类、实验洞察与未来前沿

人工智能在空战中的局限及其真正适用领域

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Bayesian Semiparametric Markov Renewal Mixed Models for Vocalization Syntax

Arxiv

0+阅读 · 2022年12月14日

Multi-armed Bandit Learning on a Graph

Arxiv

0+阅读 · 2022年12月14日

Testing the Graph of a Gaussian Graphical Model

Arxiv

0+阅读 · 2022年12月13日

Multi-objective robust optimization using adaptive surrogate models for problems with mixed continuous-categorical parameters

Arxiv

0+阅读 · 2022年12月13日

Autoregressive Bandits

Arxiv

0+阅读 · 2022年12月12日

Retire: Robust Expectile Regression in High Dimensions

Arxiv

0+阅读 · 2022年12月11日

Bayesian Sparse Gaussian Mixture Model in High Dimensions

Arxiv

0+阅读 · 2022年12月10日

Networked Restless Bandits with Positive Externalities

Arxiv

0+阅读 · 2022年12月9日

Interpreting and Unifying Graph Neural Networks with An Optimization Framework

Arxiv

18+阅读 · 2021年1月28日

The Causal Learning of Retail Delinquency

Arxiv

15+阅读 · 2020年12月17日

相关基金

RMND5A基因在遗传性泛发性色素异常症中的致病机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

Ca2+/PKC通路在PFOS诱导的小胶质细胞炎性活化中的意义

国家自然科学基金

0+阅读 · 2015年12月31日

IL-1β通过NF-κB/Lipocalin2调控大肠癌上皮间质转化的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

PDE-ODE无穷维耦合系统的镇定与控制

国家自然科学基金

0+阅读 · 2014年12月31日

IL-17基因多态性对肝移植受者环孢素代谢和药效的影响及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

功能性遗传变异调控BARD1/BRCA1泛素化通路的机制及与儿童神经母细胞瘤的关联研究

国家自然科学基金

0+阅读 · 2013年12月31日

KIBRA及APOE基因多态性对人脑记忆功能调控机制的多模态MRI研究

国家自然科学基金

0+阅读 · 2013年12月31日

CCND1基因rs9344位点多态性影响汉族女性宫颈癌易感性的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

细胞和个体水平上Vaspin与胰岛素抵抗相互关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

番茄广谱胁迫蛋白SlUSP介导抗灰霉病的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员