统计推断: 在二元结果误分类存在的情况下进行关联研究 (Statistical inference for association studies in the presence of binary outcome misclassification) - 专知论文

会员服务 ·

0

binary · 估计/估计量 · MoDELS · 可辨认的 · 统计量 ·

2023 年 3 月 17 日

Statistical inference for association studies in the presence of binary outcome misclassification

翻译：统计推断: 在二元结果误分类存在的情况下进行关联研究

Kimberly A. Hochstedler,Martin T. Wells

from arxiv, 58 pages, 5 figures

In biomedical and public health association studies, binary outcome variables may be subject to misclassification, resulting in substantial bias in effect estimates. The feasibility of addressing binary outcome misclassification in regression models is often hindered by model identifiability issues. In this paper, we characterize the identifiability problems in this class of models as a specific case of "label switching" and leverage a pattern in the resulting parameter estimates to solve the permutation invariance of the complete data log-likelihood. Our proposed algorithm in binary outcome misclassification models does not require gold standard labels and relies only on the assumption that outcomes are correctly classified at least 50% of the time. A label switching correction is applied within estimation methods to recover unbiased effect estimates and to estimate misclassification rates in cases with one or more sequential observed outcomes. Open source software is provided to implement the proposed methods for single- and two-stage models. We give a detailed simulation study for our proposed methodology and apply these methods to data for single-stage modeling of the Medical Expenditure Panel Survey (MEPS) from 2020 and two-stage modeling of data from the Virginia Department of Criminal Justice Services.

翻译：在生物医学和公共卫生的关联研究中，二元结果变量可能会受到误分类的影响，从而导致效应估计存在重大偏差。处理回归模型中的二元结果误分类问题的可行性通常受模型可识别性问题的限制。在本文中，我们将这类模型中的可识别性问题描述为“标签混淆”的一种特定情况，并利用得到的参数估计模式来解决完整数据对数似然的排列不变性。我们提出的算法在二元结果误分类模型中不需要黄金标准标签，仅依赖于数据分类正确的假设，用于估计单个或多个顺序观测结果的误分类率和恢复无偏效应估计。我们为单阶段和双阶段模型提供了开源软件的实现。通过详细的仿真研究验证了我们提出的方法，并应用这些方法于2020年医疗支出面板调查（MEPS）的单阶段建模和来自弗吉尼亚州刑事司法服务局的双阶段建模数据。

0

相关内容

binary

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

52+阅读 · 2022年10月22日

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

72+阅读 · 2022年7月11日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【开放书】《经济与金融数据科学》，357页pdf，欧盟委员会联合研究中心，Data Science for Economics and Finance

【开放书】《经济与金融数据科学》，357页pdf，欧盟委员会联合研究中心，Data Science for Economics and Finance

专知会员服务

41+阅读 · 2022年3月24日

【脑机接口教程】Machine Learning for BCI，NeurotechEDU

【脑机接口教程】Machine Learning for BCI，NeurotechEDU

专知会员服务

35+阅读 · 2022年2月14日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【SIGMOD2020】知识图谱补全方法的现实再评价，Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study

【SIGMOD2020】知识图谱补全方法的现实再评价，Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study

专知会员服务

33+阅读 · 2020年3月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知

3+阅读 · 2022年10月22日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

经典书「统计学习要素（The Elements of Statistical Learning）」笔记与非官方习题解答

经典书「统计学习要素（The Elements of Statistical Learning）」笔记与非官方习题解答

专知

35+阅读 · 2021年4月17日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新六篇视觉问答相关论文—鲁棒性分析、虚拟意象、双曲注意力网络、R-VQA、关系推理、双线性注意力网络

【论文推荐】最新六篇视觉问答相关论文—鲁棒性分析、虚拟意象、双曲注意力网络、R-VQA、关系推理、双线性注意力网络

专知

17+阅读 · 2018年6月7日

【论文推荐】最新7篇条件随机场（CRF）相关论文—图像标注、对抗学习、端到端、注意力机制、三维人体姿态、图像分割、行为分割和识别

【论文推荐】最新7篇条件随机场（CRF）相关论文—图像标注、对抗学习、端到端、注意力机制、三维人体姿态、图像分割、行为分割和识别

专知

15+阅读 · 2018年2月13日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

HDAC6在术后认知功能障碍中的作用及其机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于广义半参数回归模型的统计推断及其应用研究

国家自然科学基金

2+阅读 · 2013年12月31日

一类连续型随机过程的非参数统计推断研究

国家自然科学基金

0+阅读 · 2013年12月31日

生物医学数据统计分析的方法、理论与应用

国家自然科学基金

2+阅读 · 2013年12月31日

不完全数据下广义半参数可加模型的统计推断

国家自然科学基金

0+阅读 · 2013年12月31日

测试一阶逻辑可定义图性质

国家自然科学基金

1+阅读 · 2013年12月31日

一元/二元焦磷酸盐三基色光致发光材料的可控化制备及其发光机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

因果推断的统计方法

国家自然科学基金

26+阅读 · 2011年12月31日

项目反应与认知诊断的贝叶斯统计推断方法

国家自然科学基金

2+阅读 · 2011年12月31日

相依与不完全数据的统计推断及其应用研究

国家自然科学基金

0+阅读 · 2008年12月31日

Robust Inference for Causal Mediation Analysis of Recurrent Event Data

Arxiv

0+阅读 · 2023年5月11日

Interpretable multimodal sentiment analysis based on textual modality descriptions by using large-scale language models

Arxiv

0+阅读 · 2023年5月11日

Flexible cost-penalized Bayesian model selection: developing inclusion paths with an application to diagnosis of heart disease

Arxiv

0+阅读 · 2023年5月10日

A Statistical Model of Bipartite Networks: Application to Cosponsorship in the United States Senate

Arxiv

0+阅读 · 2023年5月10日

Robust Model Selection with Application in Single-Cell Multiomics Data

Arxiv

0+阅读 · 2023年5月9日

Calibration Assessment and Boldness-Recalibration for Binary Events

Arxiv

0+阅读 · 2023年5月9日

Autoencoded sparse Bayesian in-IRT factorization, calibration, and amortized inference for the Work Disability Functional Assessment Battery

Arxiv

0+阅读 · 2023年5月9日

SWDPM: A Social Welfare-Optimized Data Pricing Mechanism

Arxiv

0+阅读 · 2023年5月8日

The Causal Learning of Retail Delinquency

Arxiv

15+阅读 · 2020年12月17日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

52+阅读 · 2022年10月22日

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

72+阅读 · 2022年7月11日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【开放书】《经济与金融数据科学》，357页pdf，欧盟委员会联合研究中心，Data Science for Economics and Finance

【开放书】《经济与金融数据科学》，357页pdf，欧盟委员会联合研究中心，Data Science for Economics and Finance

专知会员服务

41+阅读 · 2022年3月24日

【脑机接口教程】Machine Learning for BCI，NeurotechEDU

【脑机接口教程】Machine Learning for BCI，NeurotechEDU

专知会员服务

35+阅读 · 2022年2月14日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【SIGMOD2020】知识图谱补全方法的现实再评价，Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study

【SIGMOD2020】知识图谱补全方法的现实再评价，Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study

专知会员服务

33+阅读 · 2020年3月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《未来无人海军系统：海上无人机效能增强与作战升级概览》2025最新93页

《探索5G在海事军事通信中的潜力》

美国武装部队面临战车可维护性问题

《5G测试平台：探索5G在军事场景中的赋能平台》

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知

3+阅读 · 2022年10月22日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

经典书「统计学习要素（The Elements of Statistical Learning）」笔记与非官方习题解答

经典书「统计学习要素（The Elements of Statistical Learning）」笔记与非官方习题解答

专知

35+阅读 · 2021年4月17日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新六篇视觉问答相关论文—鲁棒性分析、虚拟意象、双曲注意力网络、R-VQA、关系推理、双线性注意力网络

【论文推荐】最新六篇视觉问答相关论文—鲁棒性分析、虚拟意象、双曲注意力网络、R-VQA、关系推理、双线性注意力网络

专知

17+阅读 · 2018年6月7日

【论文推荐】最新7篇条件随机场（CRF）相关论文—图像标注、对抗学习、端到端、注意力机制、三维人体姿态、图像分割、行为分割和识别

【论文推荐】最新7篇条件随机场（CRF）相关论文—图像标注、对抗学习、端到端、注意力机制、三维人体姿态、图像分割、行为分割和识别

专知

15+阅读 · 2018年2月13日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Robust Inference for Causal Mediation Analysis of Recurrent Event Data

Arxiv

0+阅读 · 2023年5月11日

Interpretable multimodal sentiment analysis based on textual modality descriptions by using large-scale language models

Arxiv

0+阅读 · 2023年5月11日

Flexible cost-penalized Bayesian model selection: developing inclusion paths with an application to diagnosis of heart disease

Arxiv

0+阅读 · 2023年5月10日

A Statistical Model of Bipartite Networks: Application to Cosponsorship in the United States Senate

Arxiv

0+阅读 · 2023年5月10日

Robust Model Selection with Application in Single-Cell Multiomics Data

Arxiv

0+阅读 · 2023年5月9日

Calibration Assessment and Boldness-Recalibration for Binary Events

Arxiv

0+阅读 · 2023年5月9日

Autoencoded sparse Bayesian in-IRT factorization, calibration, and amortized inference for the Work Disability Functional Assessment Battery

Arxiv

0+阅读 · 2023年5月9日

SWDPM: A Social Welfare-Optimized Data Pricing Mechanism

Arxiv

0+阅读 · 2023年5月8日

The Causal Learning of Retail Delinquency

Arxiv

15+阅读 · 2020年12月17日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

相关基金

HDAC6在术后认知功能障碍中的作用及其机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于广义半参数回归模型的统计推断及其应用研究

国家自然科学基金

2+阅读 · 2013年12月31日

一类连续型随机过程的非参数统计推断研究

国家自然科学基金

0+阅读 · 2013年12月31日

生物医学数据统计分析的方法、理论与应用

国家自然科学基金

2+阅读 · 2013年12月31日

不完全数据下广义半参数可加模型的统计推断

国家自然科学基金

0+阅读 · 2013年12月31日

测试一阶逻辑可定义图性质

国家自然科学基金

1+阅读 · 2013年12月31日

一元/二元焦磷酸盐三基色光致发光材料的可控化制备及其发光机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

因果推断的统计方法

国家自然科学基金

26+阅读 · 2011年12月31日

项目反应与认知诊断的贝叶斯统计推断方法

国家自然科学基金

2+阅读 · 2011年12月31日

相依与不完全数据的统计推断及其应用研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员