支持矢量机和线性回归与非常高维特征同时 (Support vector machines and linear regression coincide with very high-dimensional features) - 专知论文

会员服务 ·

0

支持向量 · 向量化 · 支持向量机 · 线性的 · MoDELS ·

2021 年 5 月 28 日

Support vector machines and linear regression coincide with very high-dimensional features

翻译：支持矢量机和线性回归与非常高维特征同时

Navid Ardeshir,Clayton Sanford,Daniel Hsu

from arxiv, 32 pages, 9 figures

The support vector machine (SVM) and minimum Euclidean norm least squares regression are two fundamentally different approaches to fitting linear models, but they have recently been connected in models for very high-dimensional data through a phenomenon of support vector proliferation, where every training example used to fit an SVM becomes a support vector. In this paper, we explore the generality of this phenomenon and make the following contributions. First, we prove a super-linear lower bound on the dimension (in terms of sample size) required for support vector proliferation in independent feature models, matching the upper bounds from previous works. We further identify a sharp phase transition in Gaussian feature models, bound the width of this transition, and give experimental support for its universality. Finally, we hypothesize that this phase transition occurs only in much higher-dimensional settings in the $\ell_1$ variant of the SVM, and we present a new geometric characterization of the problem that may elucidate this phenomenon for the general $\ell_p$ case.

翻译：支持矢量机(SVM)和最小的 Euclidean 规范最小平方回归是两种根本不同的方法,可以对线性模型进行匹配,但是它们最近通过支持矢量扩散的现象在非常高的维度数据模型中被连接起来,其中用于适应 SVM 的每一个培训范例都成为支持矢量扩散的矢量扩散。在本文中,我们探讨了这一现象的普遍性,并做出了以下贡献。首先,我们证明在独立特性模型中支持矢量扩散所需的尺寸(样本大小)上下行的线性约束,与以前作品的上界相匹配。我们进一步确定了高斯特征模型的尖锐阶段过渡,将这一过渡的宽度捆绑起来,并为其普遍性提供实验性支持。最后,我们假设这一阶段的过渡仅在SVM $\ell_1美元变量中的高度环境中发生,我们为一般的 $\ell_p$案例展示了对问题进行解释的新几何特征定性。

0

相关内容

支持向量

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

38+阅读 · 2020年5月30日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

124+阅读 · 2020年5月30日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

246+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

已删除

将门创投

4+阅读 · 2018年5月31日

Optimal estimation of high-dimensional location Gaussian mixtures

Arxiv

0+阅读 · 2021年7月26日

Generalization Bounds in the Predict-then-Optimize Framework

Arxiv

0+阅读 · 2021年7月23日

High Dimensional Differentially Private Stochastic Optimization with Heavy-tailed Data

High Dimensional Differentially Private Stochastic Optimization with Heavy-tailed Data

Arxiv

0+阅读 · 2021年7月23日

Robust Estimation of High-Dimensional Vector Autoregressive Models

Arxiv

0+阅读 · 2021年7月23日

Learning Quadruped Locomotion Policies with Reward Machines

Arxiv

0+阅读 · 2021年7月23日

Differentially Private (Gradient) Expectation Maximization Algorithm with Statistical Guarantees

Differentially Private (Gradient) Expectation Maximization Algorithm with Statistical Guarantees

Arxiv

0+阅读 · 2021年7月22日

Dimension-Free Anticoncentration Bounds for Gaussian Order Statistics with Discussion of Applications to Multiple Testing

Arxiv

0+阅读 · 2021年7月22日

Robust Nonparametric Regression with Deep Neural Networks

Arxiv

0+阅读 · 2021年7月21日

Universal Invariant and Equivariant Graph Neural Networks

Arxiv

5+阅读 · 2019年5月13日

Testing Matrix Rank, Optimally

Arxiv

3+阅读 · 2018年10月18日

VIP会员

文章信息

相关主题

支持向量机

相关VIP内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

38+阅读 · 2020年5月30日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

124+阅读 · 2020年5月30日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

246+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

多模态大语言模型下游调优中“保持自我”的重要性

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

相关资讯

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

已删除

将门创投

4+阅读 · 2018年5月31日

相关论文

Optimal estimation of high-dimensional location Gaussian mixtures

Arxiv

0+阅读 · 2021年7月26日

Generalization Bounds in the Predict-then-Optimize Framework

Arxiv

0+阅读 · 2021年7月23日

High Dimensional Differentially Private Stochastic Optimization with Heavy-tailed Data

High Dimensional Differentially Private Stochastic Optimization with Heavy-tailed Data

Arxiv

0+阅读 · 2021年7月23日

Robust Estimation of High-Dimensional Vector Autoregressive Models

Arxiv

0+阅读 · 2021年7月23日

Learning Quadruped Locomotion Policies with Reward Machines

Arxiv

0+阅读 · 2021年7月23日

Differentially Private (Gradient) Expectation Maximization Algorithm with Statistical Guarantees

Differentially Private (Gradient) Expectation Maximization Algorithm with Statistical Guarantees

Arxiv

0+阅读 · 2021年7月22日

Dimension-Free Anticoncentration Bounds for Gaussian Order Statistics with Discussion of Applications to Multiple Testing

Arxiv

0+阅读 · 2021年7月22日

Robust Nonparametric Regression with Deep Neural Networks

Arxiv

0+阅读 · 2021年7月21日

Universal Invariant and Equivariant Graph Neural Networks

Arxiv

5+阅读 · 2019年5月13日

Testing Matrix Rank, Optimally

Arxiv

3+阅读 · 2018年10月18日

微信扫码咨询专知VIP会员