抽样大小和失踪对计算缺失数据的影响 (The Effect of Sample Size and Missingness on Inference with Missing Data) - 专知论文

会员服务 ·

0

推断 · 频率主义学派 · 几乎必然收敛 · 样本 · 几乎必然 ·

2021 年 12 月 17 日

The Effect of Sample Size and Missingness on Inference with Missing Data

翻译：抽样大小和失踪对计算缺失数据的影响

Julian Morimoto

from arxiv, Submitted to the Journal of the American Statistical Association on December 14, 2021

When are inferences (whether Direct-Likelihood, Bayesian, or Frequentist) obtained from partial data valid? This paper answers this question by offering a new theory about inference with missing data. It proves that as the sample size increases and the extent of missingness decreases, the mean-loglikelihood function generated by partial data and that ignores the missingness mechanism will almost surely converge uniformly to that which would have been generated by complete data; and if the data are Missing at Random, this convergence depends only on sample size. Thus, inferences on partial data, such as posterior modes, uncertainty estimates, confidence intervals, likelihood ratios, and indeed, all quantities or features derived from the partial-data loglikelihood function, will approximate their true values (what they would have been given complete data). This adds to previous research which has only proved the consistency of the posterior mode. Practical implications of this result are discussed, and the theory is tested on a previous study of International Human Rights Law.

翻译：当从部分数据中获得推论(直接获益、贝叶斯或常客)时,何时从部分数据中获得推论(直接获益、贝叶斯或常客)是有效的?本文件回答这一问题时,提供了对缺失数据推断的新理论。它证明随着抽样规模的扩大和缺失程度的缩小,部分数据产生的中位相似功能将几乎肯定会与完整数据产生的推论一致;如果数据在随机时缺失,这种趋同仅取决于抽样大小。因此,对部分数据(例如后方模式、不确定性估计、信任期、概率比率,以及实际上从部分数据日志函数中得出的所有数量或特征)的推论将接近其真实值(它们本来会得到哪些完整数据 ) 。这补充了以前的研究,这些研究只证明后方模式的一致性。讨论了这一结果的实际影响,并在以前对国际人权法的研究中测试了理论。

0

相关内容

【硬核书】树与网络上的概率，716页pdf

【硬核书】树与网络上的概率，716页pdf

专知会员服务

77+阅读 · 2021年12月8日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【ICML2019 tutorial】因果推理和稳定学习（Causal Inference and Stable Learning）

【ICML2019 tutorial】因果推理和稳定学习（Causal Inference and Stable Learning）

专知会员服务

176+阅读 · 2019年12月7日

【O'Reilly AI Conference 2019】高管简报:展望在线定价和算法主导的共谋的未来（Executive Briefing: A look at the future of online pricing and algorithm-led collusion），Rebecca Gu (Electron), Cris Lowery (Baringa Partners)

【O'Reilly AI Conference 2019】高管简报:展望在线定价和算法主导的共谋的未来（Executive Briefing: A look at the future of online pricing and algorithm-led collusion），Rebecca Gu (Electron), Cris Lowery (Baringa Partners)

专知会员服务

7+阅读 · 2019年11月5日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

已删除

将门创投

6+阅读 · 2019年1月11日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

On out-of-distribution detection with Bayesian neural networks

Arxiv

0+阅读 · 2022年2月21日

Poisson-Birnbaum-Saunders Regression Model for Clustered Count Data

Arxiv

0+阅读 · 2022年2月21日

Statistical Inference for Genetic Relatedness Based on High-Dimensional Logistic Regression

Arxiv

0+阅读 · 2022年2月21日

Weakly informative priors and prior-data conflict checking for likelihood-free inference

Arxiv

0+阅读 · 2022年2月21日

Impossibility Results in AI: A Survey

Arxiv

0+阅读 · 2022年2月19日

On Variance Estimation of Random Forests

On Variance Estimation of Random Forests

Arxiv

0+阅读 · 2022年2月18日

Addressing Positivity Violations in Causal Effect Estimation using Gaussian Process Priors

Arxiv

0+阅读 · 2022年2月17日

A flexible approach for causal inference with multiple treatments and clustered survival outcomes

Arxiv

0+阅读 · 2022年2月16日

Auction Throttling and Causal Inference of Online Advertising Effects

Arxiv

0+阅读 · 2022年2月16日

Dialogue Natural Language Inference

Arxiv

7+阅读 · 2018年11月1日

VIP会员

文章信息

相关主题

频率主义学派

几乎必然收敛

相关VIP内容

【硬核书】树与网络上的概率，716页pdf

【硬核书】树与网络上的概率，716页pdf

专知会员服务

77+阅读 · 2021年12月8日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【ICML2019 tutorial】因果推理和稳定学习（Causal Inference and Stable Learning）

【ICML2019 tutorial】因果推理和稳定学习（Causal Inference and Stable Learning）

专知会员服务

176+阅读 · 2019年12月7日

【O'Reilly AI Conference 2019】高管简报:展望在线定价和算法主导的共谋的未来（Executive Briefing: A look at the future of online pricing and algorithm-led collusion），Rebecca Gu (Electron), Cris Lowery (Baringa Partners)

【O'Reilly AI Conference 2019】高管简报:展望在线定价和算法主导的共谋的未来（Executive Briefing: A look at the future of online pricing and algorithm-led collusion），Rebecca Gu (Electron), Cris Lowery (Baringa Partners)

专知会员服务

7+阅读 · 2019年11月5日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

智能体化人工智能：架构、应用及未来发展方向的综合综述

《自主武器》365页书籍

联邦学习综述：多层次聚合技术的系统分类、实验洞察与未来前沿

人工智能在空战中的局限及其真正适用领域

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

已删除

将门创投

6+阅读 · 2019年1月11日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

相关论文

On out-of-distribution detection with Bayesian neural networks

Arxiv

0+阅读 · 2022年2月21日

Poisson-Birnbaum-Saunders Regression Model for Clustered Count Data

Arxiv

0+阅读 · 2022年2月21日

Statistical Inference for Genetic Relatedness Based on High-Dimensional Logistic Regression

Arxiv

0+阅读 · 2022年2月21日

Weakly informative priors and prior-data conflict checking for likelihood-free inference

Arxiv

0+阅读 · 2022年2月21日

Impossibility Results in AI: A Survey

Arxiv

0+阅读 · 2022年2月19日

On Variance Estimation of Random Forests

On Variance Estimation of Random Forests

Arxiv

0+阅读 · 2022年2月18日

Addressing Positivity Violations in Causal Effect Estimation using Gaussian Process Priors

Arxiv

0+阅读 · 2022年2月17日

A flexible approach for causal inference with multiple treatments and clustered survival outcomes

Arxiv

0+阅读 · 2022年2月16日

Auction Throttling and Causal Inference of Online Advertising Effects

Arxiv

0+阅读 · 2022年2月16日

Dialogue Natural Language Inference

Arxiv

7+阅读 · 2018年11月1日

微信扫码咨询专知VIP会员