用于使用多重估算的不完整数据的线性回归模型的适应性贪婪前方变量选择 (Adaptive greedy forward variable selection for linear regression models with incomplete data using multiple imputation) - 专知论文

会员服务 ·

0

线性回归 · 线性的 · MoDELS · 贪心 · 前向 ·

2022 年 10 月 20 日

Adaptive greedy forward variable selection for linear regression models with incomplete data using multiple imputation

翻译：用于使用多重估算的不完整数据的线性回归模型的适应性贪婪前方变量选择

Yong-Shiuan Lee

from arxiv, 34 pages, 9 figures

Variable selection is crucial for sparse modeling in this age of big data. Missing values are common in data, and make variable selection more complicated. The approach of multiple imputation (MI) results in multiply imputed datasets for missing values, and has been widely applied in various variable selection procedures. However, directly performing variable selection on the whole MI data or bootstrapped MI data may not be worthy in terms of computation cost. To fast identify the active variables in the linear regression model, we propose the adaptive grafting procedure with three pooling rules on MI data. The proposed methods proceed iteratively, which starts from finding the active variables based on the complete case subset and then expand the working data matrix with both the number of active variables and available observations. A comprehensive simulation study shows the selection accuracy in different aspects and computational efficiency of the proposed methods. Two real-life examples illustrate the strength of the proposed methods.

翻译：变量选择对于在海量数据这个时代进行稀疏的建模至关重要。缺失值在数据中很常见, 使变量选择更加复杂。多算法( MI) 方法导致缺失值的计算数据集的倍增, 并在各种变量选择程序中广泛应用。但是, 直接执行整个MI数据或环绕的 MI数据中的变量选择, 在计算成本方面可能不值得。为了快速识别线性回归模型中的活动变量, 我们提议采用适应性组合程序, 并有三项关于MI数据的集合规则。提议的方法是迭接式的, 首先是根据完整案例子集查找活动变量, 然后以现有变量的数量和现有观测结果来扩展工作数据矩阵。全面模拟研究显示了拟议方法的不同方面的选择准确性和计算效率。两个真实的示例说明了拟议方法的强度。

0

相关内容

线性回归

线性回归是利用数理统计中回归分析，来确定两种或两种以上变量间相互依赖的定量关系的一种统计分析方法，运用十分广泛。其表达形式为y = w'x+e，e为误差服从均值为0的正态分布。

知识荟萃

精品入门和进阶教程、论文和代码整理等

更多

查看相关VIP内容、论文、资讯等

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

52+阅读 · 2022年10月22日

【硬核书】稀疏多项式优化:理论与实践，220页pdf

【硬核书】稀疏多项式优化:理论与实践，220页pdf

专知会员服务

71+阅读 · 2022年9月30日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

59+阅读 · 2020年1月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

求解时间依赖问题的隐式时空并行 Schwarz 算法研究

国家自然科学基金

0+阅读 · 2017年12月31日

非凸稀疏正则化模型与算法的研究

国家自然科学基金

3+阅读 · 2015年12月31日

Forward-Looking与Backward-Looking相结合的投资组合管理

国家自然科学基金

1+阅读 · 2014年12月31日

syndecan 4对心肌梗死合并糖尿病病人血管新生的影响及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于特征约束的三维光流模型的扩散张量图像配准研究

国家自然科学基金

0+阅读 · 2012年12月31日

蜂窝D2D异构网络Any-cast混合业务无线资源管理关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

中药对糖尿病KK-Ay小鼠肾小管上皮细胞转分化调控机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

金属纳米粒子光学性质的若干理论问题的研究

国家自然科学基金

0+阅读 · 2009年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

肿瘤细胞EGFR靶向的双功能免疫纳米胶束用于肿瘤MRI检测及药物治疗的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Stable Learning via Sparse Variable Independence

Arxiv

0+阅读 · 2022年12月2日

Testing linearity in semi-functional partially linear regression models

Arxiv

0+阅读 · 2022年12月1日

Robust multi-outcome regression with correlated covariate blocks using fused LAD-lasso

Arxiv

0+阅读 · 2022年12月1日

Learning to Select from Multiple Options

Arxiv

0+阅读 · 2022年12月1日

Efficient variational approximations for state space models

Arxiv

0+阅读 · 2022年11月30日

Efficient estimation of multiple expectations with the same sample by adaptive importance sampling and control variates

Arxiv

0+阅读 · 2022年11月30日

Bayesian order identification of ARMA models with projection predictive inference

Arxiv

0+阅读 · 2022年11月30日

PKLM: A flexible MCAR test using Classification

Arxiv

0+阅读 · 2022年11月30日

CRU: A Novel Neural Architecture for Improving the Predictive Performance of Time-Series Data

Arxiv

0+阅读 · 2022年11月30日

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Arxiv

17+阅读 · 2020年9月8日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

52+阅读 · 2022年10月22日

【硬核书】稀疏多项式优化:理论与实践，220页pdf

【硬核书】稀疏多项式优化:理论与实践，220页pdf

专知会员服务

71+阅读 · 2022年9月30日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

59+阅读 · 2020年1月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【ACMMM2025教程】打击网络虚假信息视频：特征分析、检测与防范，170页ppt

海军无人系统：海上作战的演进而非革命

Nature 子刊 | SciToolAgent:知识图谱引导的科学工具智能体

多媒体顶会ACM Multimedia 2025各大奖项揭晓！格拉斯哥大学等获最佳论文，中科院自动化所等获最佳学生论文

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Stable Learning via Sparse Variable Independence

Arxiv

0+阅读 · 2022年12月2日

Testing linearity in semi-functional partially linear regression models

Arxiv

0+阅读 · 2022年12月1日

Robust multi-outcome regression with correlated covariate blocks using fused LAD-lasso

Arxiv

0+阅读 · 2022年12月1日

Learning to Select from Multiple Options

Arxiv

0+阅读 · 2022年12月1日

Efficient variational approximations for state space models

Arxiv

0+阅读 · 2022年11月30日

Efficient estimation of multiple expectations with the same sample by adaptive importance sampling and control variates

Arxiv

0+阅读 · 2022年11月30日

Bayesian order identification of ARMA models with projection predictive inference

Arxiv

0+阅读 · 2022年11月30日

PKLM: A flexible MCAR test using Classification

Arxiv

0+阅读 · 2022年11月30日

CRU: A Novel Neural Architecture for Improving the Predictive Performance of Time-Series Data

Arxiv

0+阅读 · 2022年11月30日

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Arxiv

17+阅读 · 2020年9月8日

相关基金

求解时间依赖问题的隐式时空并行 Schwarz 算法研究

国家自然科学基金

0+阅读 · 2017年12月31日

非凸稀疏正则化模型与算法的研究

国家自然科学基金

3+阅读 · 2015年12月31日

Forward-Looking与Backward-Looking相结合的投资组合管理

国家自然科学基金

1+阅读 · 2014年12月31日

syndecan 4对心肌梗死合并糖尿病病人血管新生的影响及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于特征约束的三维光流模型的扩散张量图像配准研究

国家自然科学基金

0+阅读 · 2012年12月31日

蜂窝D2D异构网络Any-cast混合业务无线资源管理关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

中药对糖尿病KK-Ay小鼠肾小管上皮细胞转分化调控机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

金属纳米粒子光学性质的若干理论问题的研究

国家自然科学基金

0+阅读 · 2009年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

肿瘤细胞EGFR靶向的双功能免疫纳米胶束用于肿瘤MRI检测及药物治疗的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员