多级物流倒退率(接近最低水平) (Convergence Rates for Multi-classs Logistic Regression Near Minimum) - 专知论文

会员服务 ·

0

极小点 · 对数几率回归 · 损失函数（机器学习） · 估计/估计量 · Neural Networks ·

2021 年 11 月 5 日

Convergence Rates for Multi-classs Logistic Regression Near Minimum

翻译：多级物流倒退率(接近最低水平)

Dwight Nwaigwe,Marek Rychlik

from arxiv, minor changes in notation, theorem 7.1 fixed

In the current paper we provide constructive estimation of the convergence rate for training a known class of neural networks: the multi-class logistic regression. Despite several decades of successful use, our rigorous results appear new, reflective of the gap between practice and theory of machine learning. Training a neural network is typically done via variations of the gradient descent method. If a minimum of the loss function exists and gradient descent is used as the training method, we provide an expression that relates learning rate to the rate of convergence to the minimum. The method involves an estimate of the condition number of the Hessian of the loss function. We also discuss the existence of a minimum, as it is not automatic that a minimum exists. One method of ensuring convergence is by assigning positive probabiity to every class in the training dataset.

翻译：在本文中,我们对培训已知神经网络类别的趋同率提供了建设性的估计:多级后勤倒退。尽管我们成功使用了数十年,但我们的严格结果似乎是新的,反映了机器学习实践和理论之间的差距。培训神经网络通常是通过梯度下降法的变异进行。如果存在最低限度的损失函数,并且使用梯度下降法作为培训方法,我们提供一种表达方式,将学习率与最低程度趋同率联系起来。这种方法涉及对损失函数赫西安人条件数的估计。我们还讨论最低限度的存在,因为最低程度并非自动存在。确保趋同的一种方法是在培训数据集中为每一类人指定积极的正直。

0

相关内容

极小点

NeurIPS 2021 | 用简单的梯度下降算法逃离鞍点

NeurIPS 2021 | 用简单的梯度下降算法逃离鞍点

专知会员服务

24+阅读 · 2021年12月6日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡一分钟】ProbFlow:联合光流和不确定性估计

【泡泡一分钟】ProbFlow:联合光流和不确定性估计

泡泡机器人SLAM

3+阅读 · 2018年10月26日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文推荐】最新6篇图像分割相关论文—隐马尔可夫随机场、级联三维全卷积、信号处理、全卷积网络、多源域适应、循环分割

【论文推荐】最新6篇图像分割相关论文—隐马尔可夫随机场、级联三维全卷积、信号处理、全卷积网络、多源域适应、循环分割

专知

9+阅读 · 2018年3月21日

神经网络学习率设置

神经网络学习率设置

机器学习研究会

4+阅读 · 2018年3月3日

【关关的刷题日记63】Leetcode 111 Minimum Depth of Binary Tree

【关关的刷题日记63】Leetcode 111 Minimum Depth of Binary Tree

专知

6+阅读 · 2017年12月11日

逻辑回归（Logistic Regression）模型简介

逻辑回归（Logistic Regression）模型简介

全球人工智能

5+阅读 · 2017年11月1日

【推荐】卷积神经网络类间不平衡问题系统研究

【推荐】卷积神经网络类间不平衡问题系统研究

机器学习研究会

6+阅读 · 2017年10月18日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Smooth Nested Simulation: Bridging Cubic and Square Root Convergence Rates in High Dimensions

Arxiv

0+阅读 · 2022年1月9日

Distributed Random Reshuffling over Networks

Arxiv

0+阅读 · 2022年1月9日

Global Convergence Analysis of Deep Linear Networks with A One-neuron Layer

Arxiv

0+阅读 · 2022年1月8日

The Langevin Monte Carlo algorithm in the non-smooth log-concave case

Arxiv

0+阅读 · 2022年1月7日

Convergence and Complexity of Stochastic Block Majorization-Minimization

Arxiv

0+阅读 · 2022年1月5日

Gaussian Process Regression in the Flat Limit

Arxiv

0+阅读 · 2022年1月4日

Fast Margin Maximization via Dual Acceleration

Arxiv

4+阅读 · 2021年7月1日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

Optimization for deep learning: theory and algorithms

Optimization for deep learning: theory and algorithms

Arxiv

106+阅读 · 2019年12月19日

Optimal Algorithms for Distributed Optimization

Arxiv

3+阅读 · 2017年12月1日

VIP会员

文章信息

相关主题

对数几率回归

损失函数（机器学习）

估计/估计量

Neural Networks

相关VIP内容

NeurIPS 2021 | 用简单的梯度下降算法逃离鞍点

NeurIPS 2021 | 用简单的梯度下降算法逃离鞍点

专知会员服务

24+阅读 · 2021年12月6日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《美陆军徒步机动作战条令手册》最新168页

【博士论文】基于不确定性的可靠性：现代机器学习中的选择性预测与可信部署

军事后勤数字化未来展望

《美海军后勤体系整合与创新挑战》最新报告

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡一分钟】ProbFlow:联合光流和不确定性估计

【泡泡一分钟】ProbFlow:联合光流和不确定性估计

泡泡机器人SLAM

3+阅读 · 2018年10月26日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文推荐】最新6篇图像分割相关论文—隐马尔可夫随机场、级联三维全卷积、信号处理、全卷积网络、多源域适应、循环分割

【论文推荐】最新6篇图像分割相关论文—隐马尔可夫随机场、级联三维全卷积、信号处理、全卷积网络、多源域适应、循环分割

专知

9+阅读 · 2018年3月21日

神经网络学习率设置

神经网络学习率设置

机器学习研究会

4+阅读 · 2018年3月3日

【关关的刷题日记63】Leetcode 111 Minimum Depth of Binary Tree

【关关的刷题日记63】Leetcode 111 Minimum Depth of Binary Tree

专知

6+阅读 · 2017年12月11日

逻辑回归（Logistic Regression）模型简介

逻辑回归（Logistic Regression）模型简介

全球人工智能

5+阅读 · 2017年11月1日

【推荐】卷积神经网络类间不平衡问题系统研究

【推荐】卷积神经网络类间不平衡问题系统研究

机器学习研究会

6+阅读 · 2017年10月18日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Smooth Nested Simulation: Bridging Cubic and Square Root Convergence Rates in High Dimensions

Arxiv

0+阅读 · 2022年1月9日

Distributed Random Reshuffling over Networks

Arxiv

0+阅读 · 2022年1月9日

Global Convergence Analysis of Deep Linear Networks with A One-neuron Layer

Arxiv

0+阅读 · 2022年1月8日

The Langevin Monte Carlo algorithm in the non-smooth log-concave case

Arxiv

0+阅读 · 2022年1月7日

Convergence and Complexity of Stochastic Block Majorization-Minimization

Arxiv

0+阅读 · 2022年1月5日

Gaussian Process Regression in the Flat Limit

Arxiv

0+阅读 · 2022年1月4日

Fast Margin Maximization via Dual Acceleration

Arxiv

4+阅读 · 2021年7月1日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

Optimization for deep learning: theory and algorithms

Optimization for deep learning: theory and algorithms

Arxiv

106+阅读 · 2019年12月19日

Optimal Algorithms for Distributed Optimization

Arxiv

3+阅读 · 2017年12月1日

微信扫码咨询专知VIP会员