通过双加速实现快速边边距最大化 (Fast Margin Maximization via Dual Acceleration) - 专知论文

会员服务 ·

0

边缘化 · 对率损失 · FAST · 线性分类 · Performer ·

2021 年 8 月 22 日

Fast Margin Maximization via Dual Acceleration

翻译：通过双加速实现快速边边距最大化

Ziwei Ji,Nathan Srebro,Matus Telgarsky

from arxiv, ICML 2021

We present and analyze a momentum-based gradient method for training linear classifiers with an exponentially-tailed loss (e.g., the exponential or logistic loss), which maximizes the classification margin on separable data at a rate of $\widetilde{\mathcal{O}}(1/t^2)$. This contrasts with a rate of $\mathcal{O}(1/\log(t))$ for standard gradient descent, and $\mathcal{O}(1/t)$ for normalized gradient descent. This momentum-based method is derived via the convex dual of the maximum-margin problem, and specifically by applying Nesterov acceleration to this dual, which manages to result in a simple and intuitive method in the primal. This dual view can also be used to derive a stochastic variant, which performs adaptive non-uniform sampling via the dual variables.

翻译：我们提出并分析一种基于动力的梯度方法,用于培训具有指数尾数损失(如指数或后勤损失)的线性分类员,该方法使可分离数据的分类幅度最大化,以$\ 宽度{O}{(1/t ⁇ 2)$的速率最大化。这与标准梯度下降的速率$mathcal{O}(1/\log(t))美元和正常梯度下降的$$\mathcal{O}(1/t)美元形成对照。这一基于动力的方法通过最大海拔问题的二次曲线,特别是将Nesterov加速到这一双重数据中,这导致在原始值中采用简单和直观的方法。这种双重观点也可以用来产生一种随机可变的变量,通过双重变量进行适应的非统一取样。

0

相关内容

边缘化

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Google最新《机器学习对偶性》报告，48页ppt

Google最新《机器学习对偶性》报告，48页ppt

专知会员服务

36+阅读 · 2020年11月29日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

【ACL2020】对抗性文本生成，Improving Adversarial Text Generation

专知会员服务

52+阅读 · 2020年5月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【论文】Awesome Relation Classification Paper（关系分类）（PART I）

【论文】Awesome Relation Classification Paper（关系分类）（PART I）

AINLP

5+阅读 · 2019年8月8日

已删除

将门创投

8+阅读 · 2019年7月10日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Halpern-Type Accelerated and Splitting Algorithms For Monotone Inclusions

Arxiv

0+阅读 · 2021年10月15日

Reward-Weighted Regression Converges to a Global Optimum

Reward-Weighted Regression Converges to a Global Optimum

Arxiv

0+阅读 · 2021年10月15日

The Convex Geometry of Backpropagation: Neural Network Gradient Flows Converge to Extreme Points of the Dual Convex Program

Arxiv

0+阅读 · 2021年10月13日

Pairwise Margin Maximization for Deep Neural Networks

Arxiv

0+阅读 · 2021年10月9日

RED++ : Data-Free Pruning of Deep Neural Networks via Input Splitting and Output Merging

Arxiv

0+阅读 · 2021年9月30日

An Optimal Control Framework for Joint-channel Parallel MRI Reconstruction without Coil Sensitivities

Arxiv

0+阅读 · 2021年9月20日

Fast and Accurate Optimization of Metasurfaces with Gradient Descent and the Woodbury Matrix Identity

Arxiv

0+阅读 · 2021年7月7日

Towards Understanding Acceleration Tradeoff between Momentum and Asynchrony in Nonconvex Stochastic Optimization

Arxiv

3+阅读 · 2018年10月1日

Accelerated Reinforcement Learning

Arxiv

6+阅读 · 2018年4月24日

Feature-Based Aggregation and Deep Reinforcement Learning: A Survey and Some New Implementations

Arxiv

9+阅读 · 2018年4月22日

VIP会员

文章信息

相关主题

相关VIP内容

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Google最新《机器学习对偶性》报告，48页ppt

Google最新《机器学习对偶性》报告，48页ppt

专知会员服务

36+阅读 · 2020年11月29日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

【ACL2020】对抗性文本生成，Improving Adversarial Text Generation

专知会员服务

52+阅读 · 2020年5月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【伯克利博士论文】通过真实世界实践赋能机器人自主性

军用无人机集群技术尚未成熟——但潜力可期

人工智能安全治理白皮书（2025）

AgentOps综述：分类、挑战与未来方向

相关资讯

【论文】Awesome Relation Classification Paper（关系分类）（PART I）

【论文】Awesome Relation Classification Paper（关系分类）（PART I）

AINLP

5+阅读 · 2019年8月8日

已删除

将门创投

8+阅读 · 2019年7月10日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Halpern-Type Accelerated and Splitting Algorithms For Monotone Inclusions

Arxiv

0+阅读 · 2021年10月15日

Reward-Weighted Regression Converges to a Global Optimum

Reward-Weighted Regression Converges to a Global Optimum

Arxiv

0+阅读 · 2021年10月15日

The Convex Geometry of Backpropagation: Neural Network Gradient Flows Converge to Extreme Points of the Dual Convex Program

Arxiv

0+阅读 · 2021年10月13日

Pairwise Margin Maximization for Deep Neural Networks

Arxiv

0+阅读 · 2021年10月9日

RED++ : Data-Free Pruning of Deep Neural Networks via Input Splitting and Output Merging

Arxiv

0+阅读 · 2021年9月30日

An Optimal Control Framework for Joint-channel Parallel MRI Reconstruction without Coil Sensitivities

Arxiv

0+阅读 · 2021年9月20日

Fast and Accurate Optimization of Metasurfaces with Gradient Descent and the Woodbury Matrix Identity

Arxiv

0+阅读 · 2021年7月7日

Towards Understanding Acceleration Tradeoff between Momentum and Asynchrony in Nonconvex Stochastic Optimization

Arxiv

3+阅读 · 2018年10月1日

Accelerated Reinforcement Learning

Arxiv

6+阅读 · 2018年4月24日

Feature-Based Aggregation and Deep Reinforcement Learning: A Survey and Some New Implementations

Arxiv

9+阅读 · 2018年4月22日

微信扫码咨询专知VIP会员