因果转移随机森林:合并记录数据和随机实验,以进行强力预测 (Causal Transfer Random Forest: Combining Logged Data and Randomized Experiments for Robust Prediction) - 专知论文

会员服务 ·

0

稳健性 · 随机森林 · MoDELS · 相关系数 · 评论员 ·

2021 年 1 月 14 日

Causal Transfer Random Forest: Combining Logged Data and Randomized Experiments for Robust Prediction

翻译：因果转移随机森林:合并记录数据和随机实验,以进行强力预测

Shuxi Zeng,Murat Ali Bayir,Joesph J. Pfeiffer III,Denis Charles,Emre Kiciman

from arxiv, 9 pages, 7 figures, 2 tables, accepted to WSDM 2021

It is often critical for prediction models to be robust to distributional shifts between training and testing data. From a causal perspective, the challenge is to distinguish the stable causal relationships from the unstable spurious correlations across shifts. We describe a causal transfer random forest (CTRF) that combines existing training data with a small amount of data from a randomized experiment to train a model which is robust to the feature shifts and therefore transfers to a new targeting distribution. Theoretically, we justify the robustness of the approach against feature shifts with the knowledge from causal learning. Empirically, we evaluate the CTRF using both synthetic data experiments and real-world experiments in the Bing Ads platform, including a click prediction task and in the context of an end-to-end counterfactual optimization system. The proposed CTRF produces robust predictions and outperforms most baseline methods compared in the presence of feature shifts.

翻译：预测模型对于培训和测试数据之间的分布变化往往至关重要。从因果关系的角度来看,挑战在于将稳定的因果关系与不稳定的假相交错区分开来。我们描述了一种因果转让随机森林(CTRF),它将现有的培训数据与少量随机实验数据结合起来,以培训一种对特征变化具有活力的模型,从而向新的目标分布转移。理论上,我们用因果学习的知识来证明该方法对特征变化的稳健性。我们利用合成数据实验和Bing Ads平台上的现实世界实验,包括点击预测任务,并在终端到终端反事实优化系统的背景下,评估CTRF。拟议的CTR产生强有力的预测,并比特征变化中的大多数基线方法更完美。

0

相关内容

稳健性

最新《因果强化学习：动机，概念，挑战与应用》报告，85页ppt

最新《因果强化学习：动机，概念，挑战与应用》报告，85页ppt

专知会员服务

98+阅读 · 2020年12月14日

【KDD2020-阿里】多视角部分多标记学习中的特征诱导流形消歧

【KDD2020-阿里】多视角部分多标记学习中的特征诱导流形消歧

专知会员服务

11+阅读 · 2020年7月4日

【KDD2020】CAST:一种基于相关关系的多尺度数据自适应光谱聚类算法,CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data

【KDD2020】CAST:一种基于相关关系的多尺度数据自适应光谱聚类算法,CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data

专知会员服务

20+阅读 · 2020年6月11日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

122+阅读 · 2020年5月30日

【剑桥大学】统计因果关系的决策理论基础，Decision-theoretic foundations for statistical causality

【剑桥大学】统计因果关系的决策理论基础，Decision-theoretic foundations for statistical causality

专知会员服务

48+阅读 · 2020年5月5日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

【百度开源2019年新型冠状病毒RNA预测算法】Baidu Open-Sources RNA Prediction Algorithm for 2019 Novel Coronavirus

【百度开源2019年新型冠状病毒RNA预测算法】Baidu Open-Sources RNA Prediction Algorithm for 2019 Novel Coronavirus

专知会员服务

26+阅读 · 2020年2月6日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

已删除

将门创投

12+阅读 · 2017年10月13日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

Efficient Accuracy Prediction for Differentially Private Algorithms

Efficient Accuracy Prediction for Differentially Private Algorithms

Arxiv

0+阅读 · 2021年3月8日

Tolerance and Prediction Intervals for Non-normal Models

Arxiv

0+阅读 · 2021年3月8日

Risk Prediction with Imperfect Survival Outcome Information from Electronic Health Records

Arxiv

0+阅读 · 2021年3月7日

Identifying Principal Stratum Causal Effects Conditional on a Post-treatment Intermediate Response

Arxiv

0+阅读 · 2021年3月6日

Bayesian Doubly Robust Causal Inference via Loss Functions

Arxiv

0+阅读 · 2021年3月6日

Randomization-based joint central limit theorem and efficient covariate adjustment in stratified $2^K$ factorial experiments

Arxiv

0+阅读 · 2021年3月6日

A Machine Learning Approach for Predicting Human Preference for Graph Layouts

Arxiv

0+阅读 · 2021年3月1日

Few-shot Domain Adaptation by Causal Mechanism Transfer

Arxiv

6+阅读 · 2020年8月19日

Causal Embeddings for Recommendation

Arxiv

23+阅读 · 2018年8月3日

Student Success Prediction in MOOCs

Arxiv

4+阅读 · 2018年4月11日

VIP会员

文章信息

相关主题

相关VIP内容

最新《因果强化学习：动机，概念，挑战与应用》报告，85页ppt

最新《因果强化学习：动机，概念，挑战与应用》报告，85页ppt

专知会员服务

98+阅读 · 2020年12月14日

【KDD2020-阿里】多视角部分多标记学习中的特征诱导流形消歧

【KDD2020-阿里】多视角部分多标记学习中的特征诱导流形消歧

专知会员服务

11+阅读 · 2020年7月4日

【KDD2020】CAST:一种基于相关关系的多尺度数据自适应光谱聚类算法,CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data

【KDD2020】CAST:一种基于相关关系的多尺度数据自适应光谱聚类算法,CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data

专知会员服务

20+阅读 · 2020年6月11日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

122+阅读 · 2020年5月30日

【剑桥大学】统计因果关系的决策理论基础，Decision-theoretic foundations for statistical causality

【剑桥大学】统计因果关系的决策理论基础，Decision-theoretic foundations for statistical causality

专知会员服务

48+阅读 · 2020年5月5日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

【百度开源2019年新型冠状病毒RNA预测算法】Baidu Open-Sources RNA Prediction Algorithm for 2019 Novel Coronavirus

【百度开源2019年新型冠状病毒RNA预测算法】Baidu Open-Sources RNA Prediction Algorithm for 2019 Novel Coronavirus

专知会员服务

26+阅读 · 2020年2月6日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

从社会学实验到行为仿真：理解基于Agent的观点动力学建模思维

中英文版《GPT-5 System Card速览》报告

ACL 2025 | 大模型结构化知识提示的泛化能力研究

【普林斯顿博士论文】大型模型的高效推理

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

已删除

将门创投

12+阅读 · 2017年10月13日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

相关论文

Efficient Accuracy Prediction for Differentially Private Algorithms

Efficient Accuracy Prediction for Differentially Private Algorithms

Arxiv

0+阅读 · 2021年3月8日

Tolerance and Prediction Intervals for Non-normal Models

Arxiv

0+阅读 · 2021年3月8日

Risk Prediction with Imperfect Survival Outcome Information from Electronic Health Records

Arxiv

0+阅读 · 2021年3月7日

Identifying Principal Stratum Causal Effects Conditional on a Post-treatment Intermediate Response

Arxiv

0+阅读 · 2021年3月6日

Bayesian Doubly Robust Causal Inference via Loss Functions

Arxiv

0+阅读 · 2021年3月6日

Randomization-based joint central limit theorem and efficient covariate adjustment in stratified $2^K$ factorial experiments

Arxiv

0+阅读 · 2021年3月6日

A Machine Learning Approach for Predicting Human Preference for Graph Layouts

Arxiv

0+阅读 · 2021年3月1日

Few-shot Domain Adaptation by Causal Mechanism Transfer

Arxiv

6+阅读 · 2020年8月19日

Causal Embeddings for Recommendation

Arxiv

23+阅读 · 2018年8月3日

Student Success Prediction in MOOCs

Arxiv

4+阅读 · 2018年4月11日

微信扫码咨询专知VIP会员