普通图形上的因果强盗 (Causal Bandits on General Graphs) - 专知论文

会员服务 ·

0

赌博机/老虎机 · SimPLe · 图 · Atom（文本编辑器） · 贝叶斯网/贝叶斯网络 ·

2021 年 7 月 6 日

Causal Bandits on General Graphs

翻译：普通图形上的因果强盗

Aurghya Maiti,Vineet Nair,Gaurav Sinha

from arxiv, 35 pages

We study the problem of determining the best intervention in a Causal Bayesian Network (CBN) specified only by its causal graph. We model this as a stochastic multi-armed bandit (MAB) problem with side-information, where the interventions correspond to the arms of the bandit instance. First, we propose a simple regret minimization algorithm that takes as input a semi-Markovian causal graph with atomic interventions and possibly unobservable variables, and achieves $\tilde{O}(\sqrt{M/T})$ expected simple regret, where $M$ is dependent on the input CBN and could be very small compared to the number of arms. We also show that this is almost optimal for CBNs described by causal graphs having an $n$-ary tree structure. Our simple regret minimization results, both upper and lower bound, subsume previous results in the literature, which assumed additional structural restrictions on the input causal graph. In particular, our results indicate that the simple regret guarantee of our proposed algorithm can only be improved by considering more nuanced structural restrictions on the causal graph. Next, we propose a cumulative regret minimization algorithm that takes as input a general causal graph with all observable nodes and atomic interventions and performs better than the optimal MAB algorithm that does not take causal side-information into account. We also experimentally compare both our algorithms with the best known algorithms in the literature. To the best of our knowledge, this work gives the first simple and cumulative regret minimization algorithms for CBNs with general causal graphs under atomic interventions and having unobserved confounders.

翻译：我们研究在Causal Bayesian网络(CBN)中确定最佳干预方法的问题,只是其因果图所指明的。我们用侧信息来模拟这个问题,因为干预方法与土匪的手臂相对应。首先,我们提出一个简单的遗憾最小化算法,将半马尔科瓦因果图和原子干预和可能无法观察的变量作为投入,并实现美元(sqrt{O})(sqrt{M/T})预期的简单遗憾,即$M$依赖于输入的CBN,与武器数量相比,其累积性可能非常小。我们还表明,对于带有美元树结构的因果图所描述的CBBNs几乎是最佳的。我们简单的遗憾最小化算算法,它假定对输入的因果性图表有额外的结构性限制。特别是,我们提议的算法的简单遗憾保证只能通过首先考虑对因果图的侧面结构限制来改进,而与武器数量相比可能非常小。我们还表明,对于C的累积性图表来说,这几乎都是最佳的因果算算法,我们最坏的。

0

相关内容

赌博机/老虎机

赌博机/老虎机

【MPG & MILA 】因果表示学习，Towards Causal Representation Learning

专知会员服务

52+阅读 · 2021年7月29日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知会员服务

87+阅读 · 2020年8月28日

【KDD2020】现实世界超图的结构模式和生成模型，Structural Patterns and Generative Models of Real-world Hypergraphs

【KDD2020】现实世界超图的结构模式和生成模型，Structural Patterns and Generative Models of Real-world Hypergraphs

专知会员服务

37+阅读 · 2020年6月16日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【ICML2019 tutorial】因果推理和稳定学习（Causal Inference and Stable Learning）

【ICML2019 tutorial】因果推理和稳定学习（Causal Inference and Stable Learning）

专知会员服务

176+阅读 · 2019年12月7日

【O'Reilly AI Conference 2019】高管简报:展望在线定价和算法主导的共谋的未来（Executive Briefing: A look at the future of online pricing and algorithm-led collusion），Rebecca Gu (Electron), Cris Lowery (Baringa Partners)

【O'Reilly AI Conference 2019】高管简报:展望在线定价和算法主导的共谋的未来（Executive Briefing: A look at the future of online pricing and algorithm-led collusion），Rebecca Gu (Electron), Cris Lowery (Baringa Partners)

专知会员服务

7+阅读 · 2019年11月5日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【IJCAI 2019 Tutorials】基于概率图模型的医疗决策分析（Medical decision analysis with probabilistic graphical models）

【IJCAI 2019 Tutorials】基于概率图模型的医疗决策分析（Medical decision analysis with probabilistic graphical models）

专知会员服务

46+阅读 · 2019年8月10日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

逆强化学习几篇论文笔记

逆强化学习几篇论文笔记

CreateAMind

9+阅读 · 2018年12月13日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

专知

19+阅读 · 2018年6月26日

强化学习初探 - 从多臂老虎机问题说起

强化学习初探 - 从多臂老虎机问题说起

专知

10+阅读 · 2018年4月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Dependent Dirichlet Processes for Analysis of a Generalized Shared Frailty Model

Arxiv

0+阅读 · 2021年9月9日

Parameterizing and Simulating from Causal Models

Parameterizing and Simulating from Causal Models

Arxiv

0+阅读 · 2021年9月8日

Inference and model selection in general causal time series with exogenous covariates

Arxiv

0+阅读 · 2021年9月5日

Community Detection in General Hypergraph via Graph Embedding

Community Detection in General Hypergraph via Graph Embedding

Arxiv

1+阅读 · 2021年9月3日

Generative Graph Convolutional Network for Growing Graphs

Generative Graph Convolutional Network for Growing Graphs

Arxiv

3+阅读 · 2019年3月6日

Lipschitz Generative Adversarial Nets

Arxiv

8+阅读 · 2019年2月15日

Variational Bayesian Reinforcement Learning with Regret Bounds

Arxiv

3+阅读 · 2018年7月25日

Inductive Representation Learning on Large Graphs

Arxiv

3+阅读 · 2018年4月10日

Stable Distribution Alignment Using the Dual of the Adversarial Distance

Arxiv

3+阅读 · 2018年1月30日

Neural Response Generation with Dynamic Vocabularies

Arxiv

5+阅读 · 2017年11月30日

VIP会员

文章信息

相关主题

赌博机/老虎机

Atom（文本编辑器）

贝叶斯网/贝叶斯网络

相关VIP内容

【MPG & MILA 】因果表示学习，Towards Causal Representation Learning

专知会员服务

52+阅读 · 2021年7月29日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知会员服务

87+阅读 · 2020年8月28日

【KDD2020】现实世界超图的结构模式和生成模型，Structural Patterns and Generative Models of Real-world Hypergraphs

【KDD2020】现实世界超图的结构模式和生成模型，Structural Patterns and Generative Models of Real-world Hypergraphs

专知会员服务

37+阅读 · 2020年6月16日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【ICML2019 tutorial】因果推理和稳定学习（Causal Inference and Stable Learning）

【ICML2019 tutorial】因果推理和稳定学习（Causal Inference and Stable Learning）

专知会员服务

176+阅读 · 2019年12月7日

【O'Reilly AI Conference 2019】高管简报:展望在线定价和算法主导的共谋的未来（Executive Briefing: A look at the future of online pricing and algorithm-led collusion），Rebecca Gu (Electron), Cris Lowery (Baringa Partners)

【O'Reilly AI Conference 2019】高管简报:展望在线定价和算法主导的共谋的未来（Executive Briefing: A look at the future of online pricing and algorithm-led collusion），Rebecca Gu (Electron), Cris Lowery (Baringa Partners)

专知会员服务

7+阅读 · 2019年11月5日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【IJCAI 2019 Tutorials】基于概率图模型的医疗决策分析（Medical decision analysis with probabilistic graphical models）

【IJCAI 2019 Tutorials】基于概率图模型的医疗决策分析（Medical decision analysis with probabilistic graphical models）

专知会员服务

46+阅读 · 2019年8月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICCV2025教程】基础模型遇见具身智能体

军事机器学习设计：关于开发自动化任务摘要系统的梯次化设计科学研究 | 2025最新93页

扩散模型中的缓存方法综述：迈向高效的多模态生成

【ICCV2025教程】《迈向视觉语言模型的全面推理》

相关资讯

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

逆强化学习几篇论文笔记

逆强化学习几篇论文笔记

CreateAMind

9+阅读 · 2018年12月13日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

专知

19+阅读 · 2018年6月26日

强化学习初探 - 从多臂老虎机问题说起

强化学习初探 - 从多臂老虎机问题说起

专知

10+阅读 · 2018年4月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Dependent Dirichlet Processes for Analysis of a Generalized Shared Frailty Model

Arxiv

0+阅读 · 2021年9月9日

Parameterizing and Simulating from Causal Models

Parameterizing and Simulating from Causal Models

Arxiv

0+阅读 · 2021年9月8日

Inference and model selection in general causal time series with exogenous covariates

Arxiv

0+阅读 · 2021年9月5日

Community Detection in General Hypergraph via Graph Embedding

Community Detection in General Hypergraph via Graph Embedding

Arxiv

1+阅读 · 2021年9月3日

Generative Graph Convolutional Network for Growing Graphs

Generative Graph Convolutional Network for Growing Graphs

Arxiv

3+阅读 · 2019年3月6日

Lipschitz Generative Adversarial Nets

Arxiv

8+阅读 · 2019年2月15日

Variational Bayesian Reinforcement Learning with Regret Bounds

Arxiv

3+阅读 · 2018年7月25日

Inductive Representation Learning on Large Graphs

Arxiv

3+阅读 · 2018年4月10日

Stable Distribution Alignment Using the Dual of the Adversarial Distance

Arxiv

3+阅读 · 2018年1月30日

Neural Response Generation with Dynamic Vocabularies

Arxiv

5+阅读 · 2017年11月30日

微信扫码咨询专知VIP会员