评估重航数据处理效果估算器 (Assessment of Treatment Effect Estimators for Heavy-Tailed Data) - 专知论文

会员服务 ·

0

估计/估计量 · 真实值 · Amazon · Performer · 情景 ·

2021 年 12 月 19 日

Assessment of Treatment Effect Estimators for Heavy-Tailed Data

翻译：评估重航数据处理效果估算器

Nilesh Tripuraneni,Dhruv Madeka,Dean Foster,Dominique Perrault-Joncas,Michael I. Jordan

A central obstacle in the objective assessment of treatment effect (TE) estimators in randomized control trials (RCTs) is the lack of ground truth (or validation set) to test their performance. In this paper, we provide a novel cross-validation-like methodology to address this challenge. The key insight of our procedure is that the noisy (but unbiased) difference-of-means estimate can be used as a ground truth "label" on a portion of the RCT, to test the performance of an estimator trained on the other portion. We combine this insight with an aggregation scheme, which borrows statistical strength across a large collection of RCTs, to present an end-to-end methodology for judging an estimator's ability to recover the underlying treatment effect. We evaluate our methodology across 709 RCTs implemented in the Amazon supply chain. In the corpus of AB tests at Amazon, we highlight the unique difficulties associated with recovering the treatment effect due to the heavy-tailed nature of the response variables. In this heavy-tailed setting, our methodology suggests that procedures that aggressively downweight or truncate large values, while introducing bias, lower the variance enough to ensure that the treatment effect is more accurately estimated.

翻译：在随机控制试验(RCTs)中,对治疗效果的客观评估(TE)估计值的一个中心障碍是缺乏检验其绩效的地面真实性(或验证集),在本文中,我们提供了一种全新的交叉验证方法来应对这一挑战。我们程序的关键洞察力是,在RCT的某一部分上,可以使用吵闹(但不带偏见)的差别估计值作为地面真实性“标签”,以测试在另一部分上受过训练的测算员的性能。我们把这一洞察与一个集成计划结合起来,这个计划在大量RCT中借用了统计实力,以提出一种最终到最终的方法来判断一个估计一个估计者恢复基本治疗效果的能力。我们评估亚马逊供应链中执行的709个RCTs的方法。在亚马逊的AB测试中,我们强调由于反应变数的复杂性而恢复治疗效果方面的独特困难。在这种复杂的情况下,我们的方法表明,进取性过低或扭曲大值的程序是准确的,同时引入了偏差性。

0

相关内容

估计/估计量

估计/估计量

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

专知会员服务

30+阅读 · 2022年2月22日

中国信通院发布《全球产业创新生态发展报告——变局中的竞争、合作与开放》，69页pdf

中国信通院发布《全球产业创新生态发展报告——变局中的竞争、合作与开放》，69页pdf

专知会员服务

22+阅读 · 2022年2月18日

2021中国智能网联汽车数据安全研究（附31页PDF）

专知会员服务

27+阅读 · 2021年9月4日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【O'Reilly AI Conference 2019】对高质量数据的追求，The quest for high-quality data，滑铁卢大学Ihab Ilyas教授

【O'Reilly AI Conference 2019】对高质量数据的追求，The quest for high-quality data，滑铁卢大学Ihab Ilyas教授

专知会员服务

6+阅读 · 2019年11月5日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

时序数据异常检测工具/数据集大列表

时序数据异常检测工具/数据集大列表

极市平台

65+阅读 · 2019年2月23日

【TED】生命中的每一年的智慧

【TED】生命中的每一年的智慧

英语演讲视频每日一推

10+阅读 · 2019年1月29日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

已删除

将门创投

3+阅读 · 2019年1月8日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Differentially Private Estimation of Heterogeneous Causal Effects

Differentially Private Estimation of Heterogeneous Causal Effects

Arxiv

0+阅读 · 2022年2月22日

Approximate Bayesian Computation Based on Maxima Weighted Isolation Kernel Mapping

Arxiv

0+阅读 · 2022年2月21日

Diffusion Causal Models for Counterfactual Estimation

Arxiv

0+阅读 · 2022年2月21日

Poisson-Birnbaum-Saunders Regression Model for Clustered Count Data

Arxiv

0+阅读 · 2022年2月21日

Using Pilot Data to Size Observational Studies for the Estimation of Dynamic Treatment Regimes

Arxiv

0+阅读 · 2022年2月18日

Monte Carlo Sensitivity Analysis for Unmeasured Confounding in Dynamic Treatment Regimes

Arxiv

0+阅读 · 2022年2月18日

R-factor analysis of data generated by a combination of R- and Q-factors leads to biased loading estimates

Arxiv

0+阅读 · 2022年2月18日

On Variance Estimation of Random Forests

On Variance Estimation of Random Forests

Arxiv

0+阅读 · 2022年2月18日

Addressing Positivity Violations in Causal Effect Estimation using Gaussian Process Priors

Arxiv

0+阅读 · 2022年2月17日

A three domain covariance framework for EEG/MEG data

Arxiv

3+阅读 · 2014年10月9日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

专知会员服务

30+阅读 · 2022年2月22日

中国信通院发布《全球产业创新生态发展报告——变局中的竞争、合作与开放》，69页pdf

中国信通院发布《全球产业创新生态发展报告——变局中的竞争、合作与开放》，69页pdf

专知会员服务

22+阅读 · 2022年2月18日

2021中国智能网联汽车数据安全研究（附31页PDF）

专知会员服务

27+阅读 · 2021年9月4日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【O'Reilly AI Conference 2019】对高质量数据的追求，The quest for high-quality data，滑铁卢大学Ihab Ilyas教授

【O'Reilly AI Conference 2019】对高质量数据的追求，The quest for high-quality data，滑铁卢大学Ihab Ilyas教授

专知会员服务

6+阅读 · 2019年11月5日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争背景下俄罗斯的战略性海军分析（2022-2025年）》最新100页报告

【斯坦福博士论文】数据、决策与依赖：构建可信人工智能的挑战

人工智能时代背景下的未来海战

接触战中的无人机优势：美军旅级部队面临的小型无人机系统挑战与调整

相关资讯

时序数据异常检测工具/数据集大列表

时序数据异常检测工具/数据集大列表

极市平台

65+阅读 · 2019年2月23日

【TED】生命中的每一年的智慧

【TED】生命中的每一年的智慧

英语演讲视频每日一推

10+阅读 · 2019年1月29日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

已删除

将门创投

3+阅读 · 2019年1月8日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Differentially Private Estimation of Heterogeneous Causal Effects

Differentially Private Estimation of Heterogeneous Causal Effects

Arxiv

0+阅读 · 2022年2月22日

Approximate Bayesian Computation Based on Maxima Weighted Isolation Kernel Mapping

Arxiv

0+阅读 · 2022年2月21日

Diffusion Causal Models for Counterfactual Estimation

Arxiv

0+阅读 · 2022年2月21日

Poisson-Birnbaum-Saunders Regression Model for Clustered Count Data

Arxiv

0+阅读 · 2022年2月21日

Using Pilot Data to Size Observational Studies for the Estimation of Dynamic Treatment Regimes

Arxiv

0+阅读 · 2022年2月18日

Monte Carlo Sensitivity Analysis for Unmeasured Confounding in Dynamic Treatment Regimes

Arxiv

0+阅读 · 2022年2月18日

R-factor analysis of data generated by a combination of R- and Q-factors leads to biased loading estimates

Arxiv

0+阅读 · 2022年2月18日

On Variance Estimation of Random Forests

On Variance Estimation of Random Forests

Arxiv

0+阅读 · 2022年2月18日

Addressing Positivity Violations in Causal Effect Estimation using Gaussian Process Priors

Arxiv

0+阅读 · 2022年2月17日

A three domain covariance framework for EEG/MEG data

Arxiv

3+阅读 · 2014年10月9日

微信扫码咨询专知VIP会员