Causal 强盗组合理论勘探 (Combinatorial Pure Exploration of Causal Bandits) - 专知论文

会员服务 ·

0

赌博机/老虎机 · 图 · 样本 · 广义线性模型 · MoDELS ·

2023 年 1 月 10 日

Combinatorial Pure Exploration of Causal Bandits

翻译：Causal 强盗组合理论勘探

Nuoya Xiong,Wei Chen

The combinatorial pure exploration of causal bandits is the following online learning task: given a causal graph with unknown causal inference distributions, in each round we choose a subset of variables to intervene or do no intervention, and observe the random outcomes of all random variables, with the goal that using as few rounds as possible, we can output an intervention that gives the best (or almost best) expected outcome on the reward variable $Y$ with probability at least $1-\delta$, where $\delta$ is a given confidence level. We provide the first gap-dependent and fully adaptive pure exploration algorithms on two types of causal models -- the binary generalized linear model (BGLM) and general graphs. For BGLM, our algorithm is the first to be designed specifically for this setting and achieves polynomial sample complexity, while all existing algorithms for general graphs have either sample complexity exponential to the graph size or some unreasonable assumptions. For general graphs, our algorithm provides a significant improvement on sample complexity, and it nearly matches the lower bound we prove. Our algorithms achieve such improvement by a novel integration of prior causal bandit algorithms and prior adaptive pure exploration algorithms, the former of which utilize the rich observational feedback in causal bandits but are not adaptive to reward gaps, while the latter of which have the issue in reverse.

翻译：对因果强盗的分类纯粹探索是下述在线学习任务:在每回合中,我们选择一组变量来干预或不干预,观察所有随机变量的随机结果,目标是尽可能多地使用几轮,我们就能产生出一个最佳(或几乎最佳)的干预结果,使奖励变量的预期结果产生最佳(或最佳)Y$,概率至少为1美元或德尔塔元,其中美元为某种信任水平。我们为两种类型的因果模型提供了第一个基于差距和完全适应性的纯勘探算法,即二元通用线性模型(BGLM)和一般图表。对于BGLM来说,我们的算法是第一个专门设计用于这一设置并达到多数值样本复杂性的,而一般图表的所有现有算法要么样本复杂度至少为1美元或一些不合理的假设。对于一般图表来说,我们的算法提供了样本复杂性的重大改进,而且几乎与我们所证明的较低约束度相符。我们的算法通过对两种类型的因果关系模型进行新的整合,即以前的因果宽度直线模型(BGLMM)和一般图表,我们算算法是第一个专门设计为这一设置的组合组合,而先为这一设置是用于这一设置的,而后再采用前的因果性分析的反向导的,而后演算法,而后演算法则则则则则则则则是是先使用前的,而采用前的因果性分析,而采用前的变制反向后先采用前的,而采用前的变制。

0

相关内容

赌博机/老虎机

赌博机/老虎机

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

因果推断，Causal Inference：The Mixtape

因果推断，Causal Inference：The Mixtape

专知会员服务

107+阅读 · 2021年8月27日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

γ-TiA1合金室温脆性的研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

直接甲醇燃料电池用聚苯类电解质膜的分子设计及降解机理

国家自然科学基金

0+阅读 · 2012年12月31日

硅量子点纳米结构薄膜材料及其太阳电池的制备研究

国家自然科学基金

0+阅读 · 2012年12月31日

强各向异性Be薄膜的晶粒细化和应力弛豫机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型低聚季铵盐的合成及其萃取特性与机理的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于FeGa材料的大位移磁致伸缩传感技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

高结晶度一维TiO2制备、表面修饰及三维微区扫描光伏研究

国家自然科学基金

0+阅读 · 2009年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

A Free Lunch from the Noise: Provable and Practical Exploration for Representation Learning

Arxiv

0+阅读 · 2023年3月7日

Margin theory for the scenario-based approach to robust optimization in high dimension

Arxiv

0+阅读 · 2023年3月7日

Sampling-based Exploration for Reinforcement Learning of Dexterous Manipulation

Arxiv

0+阅读 · 2023年3月6日

Boosting Differentiable Causal Discovery via Adaptive Sample Reweighting

Arxiv

0+阅读 · 2023年3月6日

Motion-based extrinsic sensor-to-sensor calibration: Effect of reference frame selection for new and existing methods

Arxiv

0+阅读 · 2023年3月6日

Expectation consistency for calibration of neural networks

Arxiv

0+阅读 · 2023年3月5日

SPRT-based Efficient Best Arm Identification in Stochastic Bandits

Arxiv

0+阅读 · 2023年3月4日

Graph-based Simultaneous Coverage and Exploration Planning for Fast Multi-robot Search

Arxiv

0+阅读 · 2023年3月3日

Causal Inference using Multivariate Generalized Linear Mixed-Effects Models with Longitudinal Data

Arxiv

0+阅读 · 2023年3月3日

Inference for Cluster Randomized Experiments with Non-ignorable Cluster Sizes

Arxiv

0+阅读 · 2023年3月3日

VIP会员

文章信息

相关主题

赌博机/老虎机

广义线性模型

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

因果推断，Causal Inference：The Mixtape

因果推断，Causal Inference：The Mixtape

专知会员服务

107+阅读 · 2021年8月27日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

A Free Lunch from the Noise: Provable and Practical Exploration for Representation Learning

Arxiv

0+阅读 · 2023年3月7日

Margin theory for the scenario-based approach to robust optimization in high dimension

Arxiv

0+阅读 · 2023年3月7日

Sampling-based Exploration for Reinforcement Learning of Dexterous Manipulation

Arxiv

0+阅读 · 2023年3月6日

Boosting Differentiable Causal Discovery via Adaptive Sample Reweighting

Arxiv

0+阅读 · 2023年3月6日

Motion-based extrinsic sensor-to-sensor calibration: Effect of reference frame selection for new and existing methods

Arxiv

0+阅读 · 2023年3月6日

Expectation consistency for calibration of neural networks

Arxiv

0+阅读 · 2023年3月5日

SPRT-based Efficient Best Arm Identification in Stochastic Bandits

Arxiv

0+阅读 · 2023年3月4日

Graph-based Simultaneous Coverage and Exploration Planning for Fast Multi-robot Search

Arxiv

0+阅读 · 2023年3月3日

Causal Inference using Multivariate Generalized Linear Mixed-Effects Models with Longitudinal Data

Arxiv

0+阅读 · 2023年3月3日

Inference for Cluster Randomized Experiments with Non-ignorable Cluster Sizes

Arxiv

0+阅读 · 2023年3月3日

相关基金

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

γ-TiA1合金室温脆性的研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

直接甲醇燃料电池用聚苯类电解质膜的分子设计及降解机理

国家自然科学基金

0+阅读 · 2012年12月31日

硅量子点纳米结构薄膜材料及其太阳电池的制备研究

国家自然科学基金

0+阅读 · 2012年12月31日

强各向异性Be薄膜的晶粒细化和应力弛豫机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型低聚季铵盐的合成及其萃取特性与机理的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于FeGa材料的大位移磁致伸缩传感技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

高结晶度一维TiO2制备、表面修饰及三维微区扫描光伏研究

国家自然科学基金

0+阅读 · 2009年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员