具有宽度的高效内核可变变量选择 (Efficient kernel-based variable selection with sparsistency) - 专知论文

会员服务 ·

0

核化 · 再生核希尔伯特空间 · PARCO · 线性的 · 高斯核 ·

2021 年 2 月 3 日

Efficient kernel-based variable selection with sparsistency

翻译：具有宽度的高效内核可变变量选择

Xin He,Junhui Wang,Shaogao Lv

from arxiv, 27 pages, 5 figures

Variable selection is central to high-dimensional data analysis, and various algorithms have been developed. Ideally, a variable selection algorithm shall be flexible, scalable, and with theoretical guarantee, yet most existing algorithms cannot attain these properties at the same time. In this article, a three-step variable selection algorithm is developed, involving kernel-based estimation of the regression function and its gradient functions as well as a hard thresholding. Its key advantage is that it assumes no explicit model assumption, admits general predictor effects, allows for scalable computation, and attains desirable asymptotic sparsistency. The proposed algorithm can be adapted to any reproducing kernel Hilbert space (RKHS) with different kernel functions, and can be extended to interaction selection with slight modification. Its computational cost is only linear in the data dimension, and can be further improved through parallel computing. The sparsistency of the proposed algorithm is established for general RKHS under mild conditions, including linear and Gaussian kernels as special cases. Its effectiveness is also supported by a variety of simulated and real examples.

翻译：变量选择是高维数据分析的核心,并且已经开发了各种算法。理想的情况是,变量选择算法应该是灵活、可缩放的,并且有理论保证,但大多数现有算法不能同时实现这些属性。在本条中,开发了三步变量选择算法,包括对回归函数及其梯度函数以及硬阈值进行内核估计。它的关键优势在于它没有假设明确的模型假设,没有接受一般预测效应,允许可缩放的计算,并达到理想的零散状态。拟议的算法可以被调整为具有不同内核功能的任何再生产内核希尔伯特空间(RKHS),也可以扩展为互动选择而稍作修改。它的计算成本在数据方面只是线性,并且可以通过平行计算得到进一步的改进。所拟议的算法的广度是在温和的条件下为一般的RKHS系统设定的,包括线形和高阶内核作为特例。其有效性还得到了各种模拟和真实的例子的支持。

0

相关内容

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

56+阅读 · 2020年11月21日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

79+阅读 · 2020年7月26日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

121+阅读 · 2020年5月30日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

35+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【IJCAI 2019 | tutorial】材料学与AI AI for Materials Science , Lars Kotthof

【IJCAI 2019 | tutorial】材料学与AI AI for Materials Science , Lars Kotthof

专知会员服务

18+阅读 · 2019年8月12日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

42+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

17+阅读 · 2018年12月24日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Monte Carlo algorithm for the extrema of tempered stable processes

Arxiv

0+阅读 · 2021年3月29日

Light Euclidean Steiner Spanners in the Plane

Arxiv

0+阅读 · 2021年3月28日

Graph Convolutional Networks for Model-Based Learning in Nonlinear Inverse Problems

Arxiv

0+阅读 · 2021年3月28日

Consensus-Based Optimization on the Sphere: Convergence to Global Minimizers and Machine Learning

Arxiv

0+阅读 · 2021年3月26日

Variable Selection Using Nearest Neighbor Gaussian Processes

Arxiv

0+阅读 · 2021年3月26日

Bootstrapping Persistent Betti Numbers and Other Stabilizing Statistics

Arxiv

0+阅读 · 2021年3月26日

Logarithmic law of large random correlation matrix

Arxiv

0+阅读 · 2021年3月25日

Minimax Semiparametric Learning With Approximate Sparsity

Arxiv

0+阅读 · 2021年3月25日

Small Sample Spaces for Gaussian Processes

Arxiv

0+阅读 · 2021年3月24日

An Iterative Spanning Forest Framework for Superpixel Segmentation

Arxiv

9+阅读 · 2018年1月30日

VIP会员

文章信息

相关主题

再生核希尔伯特空间

相关VIP内容

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

56+阅读 · 2020年11月21日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

79+阅读 · 2020年7月26日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

121+阅读 · 2020年5月30日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

35+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【IJCAI 2019 | tutorial】材料学与AI AI for Materials Science , Lars Kotthof

【IJCAI 2019 | tutorial】材料学与AI AI for Materials Science , Lars Kotthof

专知会员服务

18+阅读 · 2019年8月12日

热门VIP内容

开通专知VIP会员享更多权益服务

《“龙式无人机”——军事行动中的铝热剂无人机系统》47页

中文版 | 美陆军与空军通过"2025项目融合"协同重塑未来军事指挥控制体系

中文版 | 算法战场：人工智能、国家安全与不断演变的威胁格局

《美国国防部网络作战测试与评估指南手册》最新40页

相关资讯

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

42+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

17+阅读 · 2018年12月24日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Monte Carlo algorithm for the extrema of tempered stable processes

Arxiv

0+阅读 · 2021年3月29日

Light Euclidean Steiner Spanners in the Plane

Arxiv

0+阅读 · 2021年3月28日

Graph Convolutional Networks for Model-Based Learning in Nonlinear Inverse Problems

Arxiv

0+阅读 · 2021年3月28日

Consensus-Based Optimization on the Sphere: Convergence to Global Minimizers and Machine Learning

Arxiv

0+阅读 · 2021年3月26日

Variable Selection Using Nearest Neighbor Gaussian Processes

Arxiv

0+阅读 · 2021年3月26日

Bootstrapping Persistent Betti Numbers and Other Stabilizing Statistics

Arxiv

0+阅读 · 2021年3月26日

Logarithmic law of large random correlation matrix

Arxiv

0+阅读 · 2021年3月25日

Minimax Semiparametric Learning With Approximate Sparsity

Arxiv

0+阅读 · 2021年3月25日

Small Sample Spaces for Gaussian Processes

Arxiv

0+阅读 · 2021年3月24日

An Iterative Spanning Forest Framework for Superpixel Segmentation

Arxiv

9+阅读 · 2018年1月30日

微信扫码咨询专知VIP会员