全球对具有临界启动功能的神经网络进行最佳全球最佳培训</s> (Globally Optimal Training of Neural Networks with Threshold Activation Functions) - 专知论文

会员服务 ·

0

阈值 · Networking · 激活函数 · Neural Networks · 泛函 ·

2023 年 3 月 6 日

Globally Optimal Training of Neural Networks with Threshold Activation Functions

翻译：全球对具有临界启动功能的神经网络进行最佳全球最佳培训

Tolga Ergen,Halil Ibrahim Gulluk,Jonathan Lacotte,Mert Pilanci

from arxiv, Accepted to ICLR 2023

Threshold activation functions are highly preferable in neural networks due to their efficiency in hardware implementations. Moreover, their mode of operation is more interpretable and resembles that of biological neurons. However, traditional gradient based algorithms such as Gradient Descent cannot be used to train the parameters of neural networks with threshold activations since the activation function has zero gradient except at a single non-differentiable point. To this end, we study weight decay regularized training problems of deep neural networks with threshold activations. We first show that regularized deep threshold network training problems can be equivalently formulated as a standard convex optimization problem, which parallels the LASSO method, provided that the last hidden layer width exceeds a certain threshold. We also derive a simplified convex optimization formulation when the dataset can be shattered at a certain layer of the network. We corroborate our theoretical results with various numerical experiments.

翻译：在神经网络中,阈值激活功能由于硬件安装效率较高,因此在神经网络中非常可取。此外,它们的运行模式更便于解释,而且与生物神经元相似。然而,传统的梯度算法,如梯度源法,不能用来训练神经网络参数的临界点启动功能,因为激活功能只有零梯度,除非在一个非区别的点上。为此,我们研究使用临界点启动的深神经网络常规化训练问题。我们首先显示,正规化的深度网络训练问题可以等同于标准二次曲线优化问题,这与LASSO方法类似,条件是最后一层隐藏的层宽度超过某一阈值。我们还在数据元可以在网络某一层被粉碎时产生简化的二次曲线优化配方。我们用各种数字实验来证实我们的理论结果。</s>

0

相关内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【ICLR2020】深度神经网络优化轨迹的平衡点，The Break-Even Point on Optimization Trajectories of Deep Neural Networks

【ICLR2020】深度神经网络优化轨迹的平衡点，The Break-Even Point on Optimization Trajectories of Deep Neural Networks

专知会员服务

34+阅读 · 2020年2月27日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

PSMA通过TRAF6和TTC3调控前列腺癌细胞自噬在CRPC产生过程中的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

谱范数下矩阵的广义最小秩逼近问题及应用

国家自然科学基金

0+阅读 · 2013年12月31日

深度聚集磁标记细胞的磁引导的研究

国家自然科学基金

0+阅读 · 2013年12月31日

"空-车"LiDAR点云数据一体化的高质量自动集成方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

离子注入对NiTi形状记忆合金相变和功能特性影响机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

热传导方程的时间最优控制与范数最优控制

国家自然科学基金

0+阅读 · 2011年12月31日

金属离子注入绝缘体纳米颗粒的合成、尺寸控制及机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

Adiponectin在肝脏缺血再灌注损伤中的抗肝细胞凋亡机制

国家自然科学基金

0+阅读 · 2009年12月31日

神经元凋亡时Egr1对BH3-only蛋白Bim的转录调控

国家自然科学基金

0+阅读 · 2009年12月31日

c-Fos/AP-1促进TRAIL介导的前列腺癌细胞凋亡的机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

Evaluation of Parameter-based Attacks against Embedded Neural Networks with Laser Injection

Arxiv

0+阅读 · 2023年4月25日

Decoupling Quantile Representations from Loss Functions

Arxiv

0+阅读 · 2023年4月25日

A Practical Algorithm for Max-Norm Optimal Binary Labeling of Graphs

Arxiv

0+阅读 · 2023年4月25日

Label-free timing analysis of modularized nuclear detectors with physics-constrained deep learning

Arxiv

0+阅读 · 2023年4月24日

Dynamic Sparse Training via Balancing the Exploration-Exploitation Trade-off

Arxiv

0+阅读 · 2023年4月24日

Improving Classification Neural Networks by using Absolute activation function (MNIST/LeNET-5 example)

Arxiv

0+阅读 · 2023年4月23日

Policy Learning under Biased Sample Selection

Arxiv

0+阅读 · 2023年4月23日

Hierarchical Training of Deep Neural Networks Using Early Exiting

Arxiv

0+阅读 · 2023年4月23日

Training Graph Neural Networks with 1000 Layers

Arxiv

13+阅读 · 2021年6月14日

Go Wide, Then Narrow: Efficient Training of Deep Thin Networks

Arxiv

15+阅读 · 2020年7月1日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【ICLR2020】深度神经网络优化轨迹的平衡点，The Break-Even Point on Optimization Trajectories of Deep Neural Networks

【ICLR2020】深度神经网络优化轨迹的平衡点，The Break-Even Point on Optimization Trajectories of Deep Neural Networks

专知会员服务

34+阅读 · 2020年2月27日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

相关论文

Evaluation of Parameter-based Attacks against Embedded Neural Networks with Laser Injection

Arxiv

0+阅读 · 2023年4月25日

Decoupling Quantile Representations from Loss Functions

Arxiv

0+阅读 · 2023年4月25日

A Practical Algorithm for Max-Norm Optimal Binary Labeling of Graphs

Arxiv

0+阅读 · 2023年4月25日

Label-free timing analysis of modularized nuclear detectors with physics-constrained deep learning

Arxiv

0+阅读 · 2023年4月24日

Dynamic Sparse Training via Balancing the Exploration-Exploitation Trade-off

Arxiv

0+阅读 · 2023年4月24日

Improving Classification Neural Networks by using Absolute activation function (MNIST/LeNET-5 example)

Arxiv

0+阅读 · 2023年4月23日

Policy Learning under Biased Sample Selection

Arxiv

0+阅读 · 2023年4月23日

Hierarchical Training of Deep Neural Networks Using Early Exiting

Arxiv

0+阅读 · 2023年4月23日

Training Graph Neural Networks with 1000 Layers

Arxiv

13+阅读 · 2021年6月14日

Go Wide, Then Narrow: Efficient Training of Deep Thin Networks

Arxiv

15+阅读 · 2020年7月1日

相关基金

PSMA通过TRAF6和TTC3调控前列腺癌细胞自噬在CRPC产生过程中的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

谱范数下矩阵的广义最小秩逼近问题及应用

国家自然科学基金

0+阅读 · 2013年12月31日

深度聚集磁标记细胞的磁引导的研究

国家自然科学基金

0+阅读 · 2013年12月31日

"空-车"LiDAR点云数据一体化的高质量自动集成方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

离子注入对NiTi形状记忆合金相变和功能特性影响机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

热传导方程的时间最优控制与范数最优控制

国家自然科学基金

0+阅读 · 2011年12月31日

金属离子注入绝缘体纳米颗粒的合成、尺寸控制及机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

Adiponectin在肝脏缺血再灌注损伤中的抗肝细胞凋亡机制

国家自然科学基金

0+阅读 · 2009年12月31日

神经元凋亡时Egr1对BH3-only蛋白Bim的转录调控

国家自然科学基金

0+阅读 · 2009年12月31日

c-Fos/AP-1促进TRAIL介导的前列腺癌细胞凋亡的机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员