关于二进制数据回归模型中可变选择方法的审查和建议 (A review and recommendations on variable selection methods in regression models for binary data) - 专知论文

会员服务 ·

0

对数几率回归 · binary · Extensibility · 频率主义学派 · 预测准确率 ·

2022 年 5 月 16 日

A review and recommendations on variable selection methods in regression models for binary data

翻译：关于二进制数据回归模型中可变选择方法的审查和建议

Souvik Bag,Kapil Gupta,Soudeep Deb

The selection of essential variables in logistic regression is vital because of its extensive use in medical studies, finance, economics and related fields. In this paper, we explore four main typologies (test-based, penalty-based, screening-based, and tree-based) of frequentist variable selection methods in logistic regression setup. Primary objective of this work is to give a comprehensive overview of the existing literature for practitioners. Underlying assumptions and theory, along with the specifics of their implementations, are detailed as well. Next, we conduct a thorough simulation study to explore the performances of fifteen different methods in terms of variable selection, estimation of coefficients, prediction accuracy as well as time complexity under various settings. We take low, moderate and high dimensional setups and consider different correlation structures for the covariates. A real-life application, using a high-dimensional gene expression data, is also included in this study to further understand the efficacy and consistency of the methods. Finally, based on our findings in the simulated data and in the real data, we provide recommendations for practitioners on the choice of variable selection methods under various contexts.

翻译：选择后勤回归中的基本变量至关重要,因为它广泛用于医学研究、金融、经济学和相关领域。在本文件中,我们探讨了物流回归设置中常见变量选择方法的四种主要类型(测试、惩罚、筛选和树本),这项工作的主要目的是为从业人员全面概述现有文献。基础假设和理论及其实施的具体细节也得到了详细阐述。接着,我们进行了彻底的模拟研究,以探讨不同情况下在变量选择、系数估计、预测准确性和时间复杂性方面的十五种不同方法的性能。我们采用了低、中度和高维设置,并考虑了共变体的不同关联结构。这项研究中还包括了一种真实生活应用,使用高维的基因表达数据,以进一步理解这些方法的功效和一致性。最后,根据我们在模拟数据和真实数据中的调查结果,我们向从业人员提供在不同情况下选择不同选择方法的建议。

0

相关内容

对数几率回归

对数几率回归

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Cidea和Fsp27蛋白调控机体脂代谢的功能研究

国家自然科学基金

0+阅读 · 2017年12月31日

动力学涨落对网络结构的影响

国家自然科学基金

0+阅读 · 2015年12月31日

SnS、SnSe、SnSxSe1-x纳米材料的可控制备与高压研究

国家自然科学基金

0+阅读 · 2015年12月31日

节理岩体摩擦律的动力学机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

LOC283683-NIPA1-BMPRII途径对胆固醇平衡和动脉粥样硬化的影响及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

日本囊对虾Leurelectin识别弧菌鞭毛蛋白的机制和功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

沥青混合料力链时空演化的细观分析

国家自然科学基金

1+阅读 · 2013年12月31日

实时原位检测细胞表面RTK受体-生长因子配体分子识别行为的新方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

离子液体为溶剂热解制备金属颗粒机理及其电化学行为研究

国家自然科学基金

0+阅读 · 2011年12月31日

A Probabilistic State Space Model for Joint Inference from Differential Equations and Data

Arxiv

0+阅读 · 2022年7月5日

The impact of clustering binary data on relative risk towards a study of inferential methods

Arxiv

0+阅读 · 2022年7月4日

Model-Free 3D Shape Control of Deformable Objects Using Novel Features Based on Modal Analysis

Arxiv

0+阅读 · 2022年7月4日

Stability Approach to Regularization Selection for Reduced-Rank Regression

Arxiv

0+阅读 · 2022年7月3日

Double soft-thresholded model for multi-group scalar on vector-valued image regression

Arxiv

0+阅读 · 2022年7月2日

The closest vector problem and the zero-temperature p-spin landscape for lossy compression

Arxiv

0+阅读 · 2022年7月1日

Flexible Bayesian Product Mixture Models for Vector Autoregressions

Flexible Bayesian Product Mixture Models for Vector Autoregressions

Arxiv

0+阅读 · 2022年7月1日

Conditional Variable Selection for Intelligent Test

Arxiv

0+阅读 · 2022年7月1日

Variational Inference for Additive Main and Multiplicative Interaction Effects Models

Arxiv

0+阅读 · 2022年6月29日

Causal Embeddings for Recommendation

Arxiv

23+阅读 · 2018年8月3日

VIP会员

文章信息

相关主题

对数几率回归

频率主义学派

预测准确率

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《2024年度美国防部作战测试与评估报告》500页

《面相未来作战空中系统中有人-无人编组的AI驱动协作模式选择》含slides

无人机编队飞行：复杂环境中作战的策略、挑战与应用

《探索军事背景下共享大语言模型：AI助手与智能体部署中可扩展性与效率的早期洞察》（含44页slides）

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

A Probabilistic State Space Model for Joint Inference from Differential Equations and Data

Arxiv

0+阅读 · 2022年7月5日

The impact of clustering binary data on relative risk towards a study of inferential methods

Arxiv

0+阅读 · 2022年7月4日

Model-Free 3D Shape Control of Deformable Objects Using Novel Features Based on Modal Analysis

Arxiv

0+阅读 · 2022年7月4日

Stability Approach to Regularization Selection for Reduced-Rank Regression

Arxiv

0+阅读 · 2022年7月3日

Double soft-thresholded model for multi-group scalar on vector-valued image regression

Arxiv

0+阅读 · 2022年7月2日

The closest vector problem and the zero-temperature p-spin landscape for lossy compression

Arxiv

0+阅读 · 2022年7月1日

Flexible Bayesian Product Mixture Models for Vector Autoregressions

Flexible Bayesian Product Mixture Models for Vector Autoregressions

Arxiv

0+阅读 · 2022年7月1日

Conditional Variable Selection for Intelligent Test

Arxiv

0+阅读 · 2022年7月1日

Variational Inference for Additive Main and Multiplicative Interaction Effects Models

Arxiv

0+阅读 · 2022年6月29日

Causal Embeddings for Recommendation

Arxiv

23+阅读 · 2018年8月3日

相关基金

Cidea和Fsp27蛋白调控机体脂代谢的功能研究

国家自然科学基金

0+阅读 · 2017年12月31日

动力学涨落对网络结构的影响

国家自然科学基金

0+阅读 · 2015年12月31日

SnS、SnSe、SnSxSe1-x纳米材料的可控制备与高压研究

国家自然科学基金

0+阅读 · 2015年12月31日

节理岩体摩擦律的动力学机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

LOC283683-NIPA1-BMPRII途径对胆固醇平衡和动脉粥样硬化的影响及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

日本囊对虾Leurelectin识别弧菌鞭毛蛋白的机制和功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

沥青混合料力链时空演化的细观分析

国家自然科学基金

1+阅读 · 2013年12月31日

实时原位检测细胞表面RTK受体-生长因子配体分子识别行为的新方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

离子液体为溶剂热解制备金属颗粒机理及其电化学行为研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员