估计分数分数 (Imputation Scores) - 专知论文

会员服务 ·

0

秩 · 预测准确率 · 得分 · 模型评估 · 值域 ·

2022 年 11 月 30 日

Imputation Scores

翻译：估计分数分数

Jeffrey Näf,Meta-Lina Spohn,Loris Michel,Nicolai Meinshausen

Given the prevalence of missing data in modern statistical research, a broad range of methods is available for any given imputation task. How does one choose the `best' imputation method in a given application? The standard approach is to select some observations, set their status to missing, and compare prediction accuracy of the methods under consideration of these observations. Besides having to somewhat artificially mask observations, a shortcoming of this approach is that imputations based on the conditional mean will rank highest if predictive accuracy is measured with quadratic loss. In contrast, we want to rank highest an imputation that can sample from the true conditional distributions. In this paper, we develop a framework called "Imputation Scores" (I-Scores) for assessing missing value imputations. We provide a specific I-Score based on density ratios and projections, that is applicable to discrete and continuous data. It does not require to mask additional observations for evaluations and is also applicable if there are no complete observations. The population version is shown to be proper in the sense that the highest rank is assigned to an imputation method that samples from the correct conditional distribution. The propriety is shown under the missing completely at random (MCAR) assumption but is also shown to be valid under missing at random (MAR) with slightly more restrictive assumptions. We show empirically on a range of data sets and imputation methods that our score consistently ranks true data high(est) and is able to avoid pitfalls usually associated with performance measures such as RMSE. Finally, we provide the R-package Iscores available on CRAN with an implementation of our method.

翻译：鉴于现代统计研究中缺少数据的普遍性,对于任何特定的估算任务,都有广泛的方法可供选择。在特定应用中,我们如何选择“最佳”估算方法?标准方法是选择一些观察,将其状况设定为缺失,比较审议这些观察的方法的预测准确性。除了需要某种人为地掩盖观察之外,这种方法的一个缺点是,基于有条件平均值的估算如果用四分位损失来衡量预测准确度,则其排名最高。相比之下,我们希望将能够从真实的有条件分布中抽样的估算方法排在最高的位置。在本文件中,我们为评估缺失的估算制定一个框架,称为“提高分”(I-Scorets) 。我们根据密度比率和预测提供具体的I-Score,这适用于离散和连续的数据。如果以四分位损失来衡量预测,则基于有条件的估算,基于条件的估算值的估算值将排在最高的位置上。(人口版本表明,最高等级被指定为从真实的有条件分布中抽样的估算方法,但以精确的准确的估算值为准。我们在精确的估算中展示了准确的准确的计算方法。

0

相关内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

专知会员服务

30+阅读 · 2022年2月22日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

负性共刺激分子B7-H3与c-Met结合调控EMT促进结直肠癌的转移及机制

国家自然科学基金

0+阅读 · 2015年12月31日

靶向EphB4的放射性分子探针在体诊治评价

国家自然科学基金

0+阅读 · 2014年12月31日

基于蛋白组学技术筛选卵巢癌早期诊断生物学标记物的前瞻性研究

国家自然科学基金

0+阅读 · 2014年12月31日

肿瘤抗原HCA587与STAT3的相互作用及其促进肿瘤转移的分子机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

化坚解毒活血法调节p53-microRNA200/HIF-1a发挥抗大肠癌转移的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

IMPDH为靶点的小分子抑制剂的设计、合成及活性研究

国家自然科学基金

0+阅读 · 2012年12月31日

CDC73基因异常在颌骨骨化纤维瘤发病中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

RI与Angiogenin相互作用调控PI3K/AKT/mTOR信号通路和ANG的核转位在膀胱癌发生发展中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

lncRNA-uc001ylu在缺氧诱导EMT促进胃癌转移中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向智能电网的中压电力线通信关键技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

Bayesian Calibration of Imperfect Computer Models using Physics-Informed Priors

Arxiv

0+阅读 · 2023年1月31日

Improving Monte Carlo Evaluation with Offline Data

Arxiv

0+阅读 · 2023年1月31日

Can Persistent Homology provide an efficient alternative for Evaluation of Knowledge Graph Completion Methods?

Arxiv

0+阅读 · 2023年1月31日

Variational Latent Branching Model for Off-Policy Evaluation

Arxiv

0+阅读 · 2023年1月31日

Measuring robustness of dynamical systems. Relating time and space to length and precision

Arxiv

0+阅读 · 2023年1月30日

Fast Exact Leverage Score Sampling from Khatri-Rao Products with Applications to Tensor Decomposition

Arxiv

0+阅读 · 2023年1月29日

GFlowNets and variational inference

Arxiv

0+阅读 · 2023年1月29日

MetaStackVis: Visually-Assisted Performance Evaluation of Metamodels

Arxiv

0+阅读 · 2023年1月28日

G-formula for causal inference via multiple imputation

Arxiv

0+阅读 · 2023年1月27日

Approaches for Enriching and Improving Textual Knowledge Bases

Arxiv

15+阅读 · 2018年4月20日

VIP会员

文章信息

相关主题

预测准确率

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

专知会员服务

30+阅读 · 2022年2月22日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】《知识图谱与大语言模型的协同应用》，544页pdf

军事通信系统：安全行动的支柱

《缓解大语言模型（LLMs）幻觉：面向应用的检索增强生成（RAG）、推理与智能体系统综述》

【新书】机器学习系统，2620页pdf

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

相关论文

Bayesian Calibration of Imperfect Computer Models using Physics-Informed Priors

Arxiv

0+阅读 · 2023年1月31日

Improving Monte Carlo Evaluation with Offline Data

Arxiv

0+阅读 · 2023年1月31日

Can Persistent Homology provide an efficient alternative for Evaluation of Knowledge Graph Completion Methods?

Arxiv

0+阅读 · 2023年1月31日

Variational Latent Branching Model for Off-Policy Evaluation

Arxiv

0+阅读 · 2023年1月31日

Measuring robustness of dynamical systems. Relating time and space to length and precision

Arxiv

0+阅读 · 2023年1月30日

Fast Exact Leverage Score Sampling from Khatri-Rao Products with Applications to Tensor Decomposition

Arxiv

0+阅读 · 2023年1月29日

GFlowNets and variational inference

Arxiv

0+阅读 · 2023年1月29日

MetaStackVis: Visually-Assisted Performance Evaluation of Metamodels

Arxiv

0+阅读 · 2023年1月28日

G-formula for causal inference via multiple imputation

Arxiv

0+阅读 · 2023年1月27日

Approaches for Enriching and Improving Textual Knowledge Bases

Arxiv

15+阅读 · 2018年4月20日

相关基金

负性共刺激分子B7-H3与c-Met结合调控EMT促进结直肠癌的转移及机制

国家自然科学基金

0+阅读 · 2015年12月31日

靶向EphB4的放射性分子探针在体诊治评价

国家自然科学基金

0+阅读 · 2014年12月31日

基于蛋白组学技术筛选卵巢癌早期诊断生物学标记物的前瞻性研究

国家自然科学基金

0+阅读 · 2014年12月31日

肿瘤抗原HCA587与STAT3的相互作用及其促进肿瘤转移的分子机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

化坚解毒活血法调节p53-microRNA200/HIF-1a发挥抗大肠癌转移的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

IMPDH为靶点的小分子抑制剂的设计、合成及活性研究

国家自然科学基金

0+阅读 · 2012年12月31日

CDC73基因异常在颌骨骨化纤维瘤发病中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

RI与Angiogenin相互作用调控PI3K/AKT/mTOR信号通路和ANG的核转位在膀胱癌发生发展中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

lncRNA-uc001ylu在缺氧诱导EMT促进胃癌转移中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向智能电网的中压电力线通信关键技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员