过度对称的可转移性:负感应器的例子 (Tractability from overparametrization: The example of the negative perceptron) - 专知论文

会员服务 ·

0

感知机 · 易处理的 · 阈值 · 向量化 · 线性的 ·

2021 年 10 月 28 日

Tractability from overparametrization: The example of the negative perceptron

翻译：过度对称的可转移性:负感应器的例子

Andrea Montanari,Yiqiao Zhong,Kangjie Zhou

from arxiv, 88 pages; 7 pdf figures

In the negative perceptron problem we are given $n$ data points $({\boldsymbol x}_i,y_i)$, where ${\boldsymbol x}_i$ is a $d$-dimensional vector and $y_i\in\{+1,-1\}$ is a binary label. The data are not linearly separable and hence we content ourselves to find a linear classifier with the largest possible \emph{negative} margin. In other words, we want to find a unit norm vector ${\boldsymbol \theta}$ that maximizes $\min_{i\le n}y_i\langle {\boldsymbol \theta},{\boldsymbol x}_i\rangle$. This is a non-convex optimization problem (it is equivalent to finding a maximum norm vector in a polytope), and we study its typical properties under two random models for the data. We consider the proportional asymptotics in which $n,d\to \infty$ with $n/d\to\delta$, and prove upper and lower bounds on the maximum margin $\kappa_{\text{s}}(\delta)$ or -- equivalently -- on its inverse function $\delta_{\text{s}}(\kappa)$. In other words, $\delta_{\text{s}}(\kappa)$ is the overparametrization threshold: for $n/d\le \delta_{\text{s}}(\kappa)-\varepsilon$ a classifier achieving vanishing training error exists with high probability, while for $n/d\ge \delta_{\text{s}}(\kappa)+\varepsilon$ it does not. Our bounds on $\delta_{\text{s}}(\kappa)$ match to the leading order as $\kappa\to -\infty$. We then analyze a linear programming algorithm to find a solution, and characterize the corresponding threshold $\delta_{\text{lin}}(\kappa)$. We observe a gap between the interpolation threshold $\delta_{\text{s}}(\kappa)$ and the linear programming threshold $\delta_{\text{lin}}(\kappa)$, raising the question of the behavior of other algorithms.

翻译：在负倍感问题中, 我们得到的是$( { boldsylmbol x ⁇ i, y_ i) 的数据点 $ ({ boldsymbol x ⁇ i$ 是一个美元维度矢量, $y_ i\ in\\ ⁇ 1, 1\ 美元是一个二进制标签。数据不是线性可分解的, 因此我们满足于找到一个具有最大可能\ emph{ negy} 差价的线性分类器。换句话说, 我们想要找到一个单位规范矢量 $( boldsyol x% i_ i_ i_ i_ i) $( boldymall xball xball_ a blookyal_ listal_ listal_ listal_ laxxxxxxx laxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

0

相关内容

感知机

感知机在机器学习中，感知机是一种二进制分类器监督学习的算法。二值分类器是一个函数，它可以决定输入是否属于某个特定的类，输入由一个数字向量表示。它是一种线性分类器，即基于线性预测函数结合一组权值和特征向量进行预测的分类算法。

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

Word Embedding List｜ACL 2020 词嵌入长文汇总及分类

Word Embedding List｜ACL 2020 词嵌入长文汇总及分类

PaperWeekly

3+阅读 · 2020年5月30日

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

深度学习自然语言处理

7+阅读 · 2020年4月8日

TensorFlow 2.0 分布式训练

TensorFlow 2.0 分布式训练

TensorFlow

8+阅读 · 2020年1月19日

word2Vec总结

AINLP

3+阅读 · 2019年11月2日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

[DLdigest-8] 每日一道算法

[DLdigest-8] 每日一道算法

深度学习每日摘要

4+阅读 · 2017年11月2日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

【推荐】(Keras)LSTM多元时序预测教程

【推荐】(Keras)LSTM多元时序预测教程

机器学习研究会

24+阅读 · 2017年8月14日

Polyak-Ruppert Averaged Q-Leaning is Statistically Efficient

Arxiv

0+阅读 · 2021年12月29日

A New Method of Construction of Permutation Trinomials with Coefficients 1

Arxiv

0+阅读 · 2021年12月29日

Parametric and nonparametric probability distribution estimators of sample maximum

Arxiv

0+阅读 · 2021年12月29日

Bias for the Trace of the Resolvent and Its Application on Non-Gaussian and Non-centered MIMO Channels

Arxiv

0+阅读 · 2021年12月28日

Improving Nonparametric Classification via Local Radial Regression with an Application to Stock Prediction

Arxiv

0+阅读 · 2021年12月28日

Unbiased Parameter Inference for a Class of Partially Observed Lévy-Process Models

Arxiv

0+阅读 · 2021年12月27日

Network Inference and Influence Maximization from Samples

Arxiv

7+阅读 · 2021年6月7日

Learning with Interpretable Structure from RNN

Arxiv

19+阅读 · 2018年10月25日

Reducing Parameter Space for Neural Network Training

Arxiv

3+阅读 · 2018年8月17日

Classification with Fairness Constraints: A Meta-Algorithm with Provable Guarantees

Classification with Fairness Constraints: A Meta-Algorithm with Provable Guarantees

Arxiv

3+阅读 · 2018年8月2日

VIP会员

文章信息

相关主题

相关VIP内容

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能驾驶：旧理念与新技术

美军手册：战术心理战分遣队与小组指南 | 68页

军事机器学习设计：关于开发自动化任务摘要系统的梯次化设计科学研究 | 2025最新93页

美国防部自主系统研制试验与鉴定指南 | 2025年最新200页

相关资讯

Word Embedding List｜ACL 2020 词嵌入长文汇总及分类

Word Embedding List｜ACL 2020 词嵌入长文汇总及分类

PaperWeekly

3+阅读 · 2020年5月30日

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

深度学习自然语言处理

7+阅读 · 2020年4月8日

TensorFlow 2.0 分布式训练

TensorFlow 2.0 分布式训练

TensorFlow

8+阅读 · 2020年1月19日

word2Vec总结

AINLP

3+阅读 · 2019年11月2日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

[DLdigest-8] 每日一道算法

[DLdigest-8] 每日一道算法

深度学习每日摘要

4+阅读 · 2017年11月2日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

【推荐】(Keras)LSTM多元时序预测教程

【推荐】(Keras)LSTM多元时序预测教程

机器学习研究会

24+阅读 · 2017年8月14日

相关论文

Polyak-Ruppert Averaged Q-Leaning is Statistically Efficient

Arxiv

0+阅读 · 2021年12月29日

A New Method of Construction of Permutation Trinomials with Coefficients 1

Arxiv

0+阅读 · 2021年12月29日

Parametric and nonparametric probability distribution estimators of sample maximum

Arxiv

0+阅读 · 2021年12月29日

Bias for the Trace of the Resolvent and Its Application on Non-Gaussian and Non-centered MIMO Channels

Arxiv

0+阅读 · 2021年12月28日

Improving Nonparametric Classification via Local Radial Regression with an Application to Stock Prediction

Arxiv

0+阅读 · 2021年12月28日

Unbiased Parameter Inference for a Class of Partially Observed Lévy-Process Models

Arxiv

0+阅读 · 2021年12月27日

Network Inference and Influence Maximization from Samples

Arxiv

7+阅读 · 2021年6月7日

Learning with Interpretable Structure from RNN

Arxiv

19+阅读 · 2018年10月25日

Reducing Parameter Space for Neural Network Training

Arxiv

3+阅读 · 2018年8月17日

Classification with Fairness Constraints: A Meta-Algorithm with Provable Guarantees

Classification with Fairness Constraints: A Meta-Algorithm with Provable Guarantees

Arxiv

3+阅读 · 2018年8月2日

微信扫码咨询专知VIP会员