对不同成因因果关系估计模型选选模式的经验分析 (Empirical Analysis of Model Selection for Heterogenous Causal Effect Estimation) - 专知论文

会员服务 ·

0

模型选择 · 估计/估计量 · MoDELS · Analysis · 数据集 ·

2022 年 11 月 3 日

Empirical Analysis of Model Selection for Heterogenous Causal Effect Estimation

翻译：对不同成因因果关系估计模型选选模式的经验分析

Divyat Mahajan,Ioannis Mitliagkas,Brady Neal,Vasilis Syrgkanis

from arxiv, Preprint. Under Review

We study the problem of model selection in causal inference, specifically for the case of conditional average treatment effect (CATE) estimation under binary treatments. Unlike model selection in machine learning, we cannot use the technique of cross-validation here as we do not observe the counterfactual potential outcome for any data point. Hence, we need to design model selection techniques that do not explicitly rely on counterfactual data. As an alternative to cross-validation, there have been a variety of proxy metrics proposed in the literature, that depend on auxiliary nuisance models also estimated from the data (propensity score model, outcome regression model). However, the effectiveness of these metrics has only been studied on synthetic datasets as we can observe the counterfactual data for them. We conduct an extensive empirical analysis to judge the performance of these metrics, where we utilize the latest advances in generative modeling to incorporate multiple realistic datasets. We evaluate 9 metrics on 144 datasets for selecting between 415 estimators per dataset, including datasets that closely mimic real-world datasets. Further, we use the latest techniques from AutoML to ensure consistent hyperparameter selection for nuisance models for a fair comparison across metrics.

翻译：我们研究因果推断中的模型选择问题,具体针对在二进制处理中进行有条件平均治疗效果(CATE)估计的情况。与机器学习中的模型选择不同,我们在这里不能使用交叉校验技术,因为我们没有观察到任何数据点的反事实潜在结果。因此,我们需要设计不明显依赖反事实数据的模型选择技术。作为交叉校验的替代办法,我们用文献中建议的各种代用指标来研究,这取决于从数据中估计的辅助性骚扰模型(对称分评分模型、结果回归模型)。然而,这些指标的有效性仅研究在合成数据集上,因为我们可以观察反事实数据。我们进行了广泛的实证分析,以判断这些指标的性能,我们利用基因化模型的最新进展来纳入多种现实的数据集。我们评估了144个数据集的9项指标,用于选择415个估量器,包括近似真实世界数据集。此外,我们使用最新的标准来进行自自动模模的比较,以确保从自动模量模型到自自动模准的一致的比标。

0

相关内容

模型选择

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

活性氧介导的内质网应激在博莱霉素诱发肺上皮-间质转化和肺纤维化中的作用

国家自然科学基金

0+阅读 · 2016年12月31日

miR-21/PDCD4/NF-κB通路在血小板抗菌促糖尿病溃疡愈合中的作用及分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

基于PERK/elF2α通路研究针刺调控MCAO/R大鼠内质网应激-自噬稳态重构的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

长链非编码RNA-VEC1340靶定KLF4在血管内皮细胞损伤中的调控及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

面向智能电网基础设施Cyber-Physical安全的自治愈基础理论研究

国家自然科学基金

1+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

基于LTCC的多重折叠基片集成波导滤波器的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calreticulin-STAT3/PKC信号通路介导的线粒体损伤在扩张型心肌病发病中的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

汽车底盘一体化控制的仿驾驶员智能控制机理及关键技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

二苯乙烯苷对氧化应激诱导的内皮细胞凋亡的影响

国家自然科学基金

0+阅读 · 2008年12月31日

Copula Graphical Models for Heterogeneous Mixed Data

Arxiv

0+阅读 · 2022年12月22日

kalis: A Modern Implementation of the Li & Stephens Model for Local Ancestry Inference in R

Arxiv

0+阅读 · 2022年12月21日

Efficient Nonparametric Estimation of Incremental Propensity Score Effects with Clustered Interference

Arxiv

0+阅读 · 2022年12月21日

On the Relationship Between Explanation and Prediction: A Causal View

Arxiv

0+阅读 · 2022年12月20日

A Survey of Deep Causal Model

Arxiv

45+阅读 · 2022年9月19日

Locally Weighted Regression with different Kernel Smoothers for Software Effort Estimation

Arxiv

0+阅读 · 2022年9月12日

A Conditional-Probability-Distribution Model for Bandwidth Estimation with Application in Live Video Streaming

Arxiv

0+阅读 · 2022年4月16日

Federated Causal Inference in Heterogeneous Observational Data

Arxiv

24+阅读 · 2021年8月10日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Copula Graphical Models for Heterogeneous Mixed Data

Arxiv

0+阅读 · 2022年12月22日

kalis: A Modern Implementation of the Li & Stephens Model for Local Ancestry Inference in R

Arxiv

0+阅读 · 2022年12月21日

Efficient Nonparametric Estimation of Incremental Propensity Score Effects with Clustered Interference

Arxiv

0+阅读 · 2022年12月21日

On the Relationship Between Explanation and Prediction: A Causal View

Arxiv

0+阅读 · 2022年12月20日

A Survey of Deep Causal Model

Arxiv

45+阅读 · 2022年9月19日

Locally Weighted Regression with different Kernel Smoothers for Software Effort Estimation

Arxiv

0+阅读 · 2022年9月12日

A Conditional-Probability-Distribution Model for Bandwidth Estimation with Application in Live Video Streaming

Arxiv

0+阅读 · 2022年4月16日

Federated Causal Inference in Heterogeneous Observational Data

Arxiv

24+阅读 · 2021年8月10日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

相关基金

活性氧介导的内质网应激在博莱霉素诱发肺上皮-间质转化和肺纤维化中的作用

国家自然科学基金

0+阅读 · 2016年12月31日

miR-21/PDCD4/NF-κB通路在血小板抗菌促糖尿病溃疡愈合中的作用及分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

基于PERK/elF2α通路研究针刺调控MCAO/R大鼠内质网应激-自噬稳态重构的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

长链非编码RNA-VEC1340靶定KLF4在血管内皮细胞损伤中的调控及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

面向智能电网基础设施Cyber-Physical安全的自治愈基础理论研究

国家自然科学基金

1+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

基于LTCC的多重折叠基片集成波导滤波器的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calreticulin-STAT3/PKC信号通路介导的线粒体损伤在扩张型心肌病发病中的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

汽车底盘一体化控制的仿驾驶员智能控制机理及关键技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

二苯乙烯苷对氧化应激诱导的内皮细胞凋亡的影响

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员