与错误数据的因果关系:计量错误、缺失值、差异化和差异隐私 (Causal Inference with Corrupted Data: Measurement Error, Missing Values, Discretization, and Differential Privacy) - 专知论文

会员服务 ·

0

推断 · 近似 · Analysis · 离散化 · 统计量 ·

2022 年 11 月 9 日

Causal Inference with Corrupted Data: Measurement Error, Missing Values, Discretization, and Differential Privacy

翻译：与错误数据的因果关系:计量错误、缺失值、差异化和差异隐私

Anish Agarwal,Rahul Singh

The US Census Bureau will deliberately corrupt data sets derived from the 2020 US Census in an effort to maintain privacy, suggesting a painful trade-off between the privacy of respondents and the precision of economic analysis. To investigate whether this trade-off is inevitable, we formulate a semiparametric model of causal inference with high dimensional corrupted data. We propose a procedure for data cleaning, estimation, and inference with data cleaning-adjusted confidence intervals. We prove consistency, Gaussian approximation, and semiparametric efficiency by finite sample arguments, with a rate of $n^{-1/2}$ for semiparametric estimands that degrades gracefully for nonparametric estimands. Our key assumption is that the true covariates are approximately low rank, which we interpret as approximate repeated measurements and validate in the Census. In our analysis, we provide nonasymptotic theoretical contributions to matrix completion, statistical learning, and semiparametric statistics. Calibrated simulations verify the coverage of our data cleaning-adjusted confidence intervals and demonstrate the relevance of our results for 2020 Census data.

翻译：美国人口普查局将故意腐蚀来自2020年美国人口普查的数据集,以维护隐私,暗示在答复者的隐私和经济分析精确度之间进行痛苦的权衡。为了调查这种权衡是否不可避免,我们用高度腐蚀的数据来制定一个因果推断的半参数模型。我们提出了一个数据清理、估计和推断的程序,用数据清理调整信任间隔来进行计算。我们通过有限的抽样参数来证明一致性、高斯近似值和半参数效率,对非参数估计值而言,半参数估计值降幅为1/2美元。我们的主要假设是,真正的共变体的等级几乎很低,我们在人口普查中将其解释为大约重复测量和验证。我们在分析中为矩阵的完成、统计学习和半参数统计提供了非抽象的理论贡献。经过校准的模拟核查了我们数据清理调整信任间隔的覆盖面,并展示了我们结果对2020年人口普查数据的关联性。

0

相关内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

Call for Nominations: 2022 Multimedia Prize Paper Award

Call for Nominations: 2022 Multimedia Prize Paper Award

CCF多媒体专委会

0+阅读 · 2022年2月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

BZT-BCT基复合薄膜磁电存储单元及其电致磁电效应研究

国家自然科学基金

0+阅读 · 2015年12月31日

受体酪氨酸激酶Tie2在血管生成与稳态维持中的调节机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Faecalibacterium prausnitzii协同LFA-1在炎症性肠病发生中调控淋巴细胞分化及功能的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

SF3B1基因调节Bcl-x可变剪接参与骨髓增生异常综合征-RARS红系无效造血的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

HIV-1 Tat蛋白诱发心肌间质纤维化促致死性心律失常

国家自然科学基金

0+阅读 · 2012年12月31日

城市大气颗粒物重金属污染特征及健康风险评估

国家自然科学基金

0+阅读 · 2012年12月31日

低氧对大鼠EIMD肌纤维膜损伤的影响机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

冬季燃煤取暖对城市地表灰尘重金属的影响及健康风险研究- - 以贵阳市为例

国家自然科学基金

0+阅读 · 2011年12月31日

血管外膜脂肪组织对动脉粥样硬化斑块稳定性的影响及其机制

国家自然科学基金

0+阅读 · 2011年12月31日

A time domain a posteriori error bound for balancing-related model order reduction

Arxiv

0+阅读 · 2023年1月3日

Modular and Incremental Global Model Management with Extended Generalized Discrimination Networks

Arxiv

0+阅读 · 2023年1月2日

High-dimensional latent Gaussian count time series: Concentration results for autocovariances and applications

Arxiv

0+阅读 · 2023年1月2日

On High dimensional Poisson models with measurement error: hypothesis testing for nonlinear nonconvex optimization

Arxiv

0+阅读 · 2022年12月31日

Separating Computational and Statistical Differential Privacy (Under Plausible Assumptions)

Arxiv

0+阅读 · 2022年12月31日

A Cross-Validated Targeted Maximum Likelihood Estimator for Data-Adaptive Experiment Selection Applied to the Augmentation of RCT Control Arms with External Data

Arxiv

0+阅读 · 2022年12月29日

Active Bayesian Causal Inference

Arxiv

14+阅读 · 2022年10月15日

Causality and Generalizability: Identifiability and Learning Methods

Arxiv

12+阅读 · 2021年10月4日

Relating Graph Neural Networks to Structural Causal Models

Arxiv

44+阅读 · 2021年9月9日

Federated Causal Inference in Heterogeneous Observational Data

Arxiv

24+阅读 · 2021年8月10日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

视觉-语言-动作模型解析：从模块构成到里程碑与挑战

《解析陆域作战方向：一个概念性框架》报告

【博士论文】基于多模态基础模型的上下文学习

追寻真正的AI自主性：从遗留思维到战场优势

相关资讯

Call for Nominations: 2022 Multimedia Prize Paper Award

Call for Nominations: 2022 Multimedia Prize Paper Award

CCF多媒体专委会

0+阅读 · 2022年2月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

A time domain a posteriori error bound for balancing-related model order reduction

Arxiv

0+阅读 · 2023年1月3日

Modular and Incremental Global Model Management with Extended Generalized Discrimination Networks

Arxiv

0+阅读 · 2023年1月2日

High-dimensional latent Gaussian count time series: Concentration results for autocovariances and applications

Arxiv

0+阅读 · 2023年1月2日

On High dimensional Poisson models with measurement error: hypothesis testing for nonlinear nonconvex optimization

Arxiv

0+阅读 · 2022年12月31日

Separating Computational and Statistical Differential Privacy (Under Plausible Assumptions)

Arxiv

0+阅读 · 2022年12月31日

A Cross-Validated Targeted Maximum Likelihood Estimator for Data-Adaptive Experiment Selection Applied to the Augmentation of RCT Control Arms with External Data

Arxiv

0+阅读 · 2022年12月29日

Active Bayesian Causal Inference

Arxiv

14+阅读 · 2022年10月15日

Causality and Generalizability: Identifiability and Learning Methods

Arxiv

12+阅读 · 2021年10月4日

Relating Graph Neural Networks to Structural Causal Models

Arxiv

44+阅读 · 2021年9月9日

Federated Causal Inference in Heterogeneous Observational Data

Arxiv

24+阅读 · 2021年8月10日

相关基金

BZT-BCT基复合薄膜磁电存储单元及其电致磁电效应研究

国家自然科学基金

0+阅读 · 2015年12月31日

受体酪氨酸激酶Tie2在血管生成与稳态维持中的调节机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Faecalibacterium prausnitzii协同LFA-1在炎症性肠病发生中调控淋巴细胞分化及功能的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

SF3B1基因调节Bcl-x可变剪接参与骨髓增生异常综合征-RARS红系无效造血的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

HIV-1 Tat蛋白诱发心肌间质纤维化促致死性心律失常

国家自然科学基金

0+阅读 · 2012年12月31日

城市大气颗粒物重金属污染特征及健康风险评估

国家自然科学基金

0+阅读 · 2012年12月31日

低氧对大鼠EIMD肌纤维膜损伤的影响机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

冬季燃煤取暖对城市地表灰尘重金属的影响及健康风险研究- - 以贵阳市为例

国家自然科学基金

0+阅读 · 2011年12月31日

血管外膜脂肪组织对动脉粥样硬化斑块稳定性的影响及其机制

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员