ReLU 激活多层神经网络,经过混合整数线性线性程序培训 (ReLU activated Multi-Layer Neural Networks trained with Mixed Integer Linear Programs) - 专知论文

会员服务 ·

0

层 · 整数线性规划 · Weight · 线性的 · ReLU ·

2021 年 4 月 9 日

ReLU activated Multi-Layer Neural Networks trained with Mixed Integer Linear Programs

翻译：ReLU 激活多层神经网络,经过混合整数线性线性程序培训

Steffen Goebbels

from arxiv, published paper. Technical Report 2021-01, Niederrhein University of Applied Sciences, Faculty of Electrical Engineering and Computer Science, 2021

In this paper, it is demonstrated through a case study that multilayer feedforward neural networks activated by ReLU functions can in principle be trained iteratively with Mixed Integer Linear Programs (MILPs) as follows. Weights are determined with batch learning. Multiple iterations are used per batch of training data. In each iteration, the algorithm starts at the output layer and propagates information back to the first hidden layer to adjust the weights using MILPs or Linear Programs. For each layer, the goal is to minimize the difference between its output and the corresponding target output. The target output of the last (output) layer is equal to the ground truth. The target output of a previous layer is defined as the adjusted input of the following layer. For a given layer, weights are computed by solving a MILP. Then, except for the first hidden layer, the input values are also modified with a MILP to better match the layer outputs to their corresponding target outputs. The method was tested and compared with Tensorflow/Keras (Adam optimizer) using two simple networks on the MNIST dataset containing handwritten digits. Accuracies of the same magnitude as with Tensorflow/Keras were achieved.

翻译：在本文中,通过案例研究可以看出,RELU 函数激活的多层向向神经网络原则上可以与混合整线线程序(MILPs)进行迭代培训,具体如下:通过批量学习确定重量。每批培训数据使用多个迭代。在每次迭代中,算法从输出层开始,将信息传播到第一个隐藏层,使用 MILP 或线性程序调整重量。对于每个层,目标是尽量减少其输出与相应目标输出之间的差别。最后一个(输出)层的目标输出等于地面真相。上一个层的目标输出被定义为下层的调整输入。对于给定的层,则通过解决一个 MILP 来计算重量。然后,除了第一个隐藏层外,输入值也用 MILP 来修改,以更好地将层输出与相应的目标输出相匹配。对方法进行了测试,并且与Tensororpol/Keras(Adam 优化器) 相比,使用两个简单的网络,将AMSMISD 数据流与已实现的手态数字进行了两次测试。

0

相关内容

【2021干货书】Python可解释人工智能，207页pdf，Explainable AI with Python

【2021干货书】Python可解释人工智能，207页pdf，Explainable AI with Python

专知会员服务

186+阅读 · 2021年5月17日

【IJCAJ 2020】多通道神经网络 Multi-Channel Graph Neural Networks

【IJCAJ 2020】多通道神经网络 Multi-Channel Graph Neural Networks

专知会员服务

26+阅读 · 2020年7月19日

【陈天奇】TVM：端到端自动深度学习编译器，244页ppt

【陈天奇】TVM：端到端自动深度学习编译器，244页ppt

专知会员服务

87+阅读 · 2020年5月11日

【ICLR2020】胶囊与反向路由点积注意力

专知会员服务

27+阅读 · 2020年2月15日

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

专知会员服务

23+阅读 · 2019年11月21日

【课程】Andrew Ng与Google Brain团队联合出品《TensorFlow in Practice 》

【课程】Andrew Ng与Google Brain团队联合出品《TensorFlow in Practice 》

专知会员服务

13+阅读 · 2019年10月29日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

使用 Keras Tuner 调节超参数

使用 Keras Tuner 调节超参数

TensorFlow

15+阅读 · 2020年2月6日

Graph Neural Networks 综述

Graph Neural Networks 综述

计算机视觉life

30+阅读 · 2019年8月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

手把手教你用Python库Keras做预测（附代码）

手把手教你用Python库Keras做预测（附代码）

数据派THU

14+阅读 · 2018年5月30日

【CNN】一文读懂卷积神经网络CNN

【CNN】一文读懂卷积神经网络CNN

产业智能官

18+阅读 · 2018年1月2日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【推荐】TensorFlow手把手CNN实践指南

【推荐】TensorFlow手把手CNN实践指南

机器学习研究会

5+阅读 · 2017年8月17日

【推荐】(Keras)LSTM多元时序预测教程

【推荐】(Keras)LSTM多元时序预测教程

机器学习研究会

24+阅读 · 2017年8月14日

Neural Network Training Using $\ell_1$-Regularization and Bi-fidelity Data

Arxiv

0+阅读 · 2021年6月1日

Student Performance Prediction Using Dynamic Neural Models

Student Performance Prediction Using Dynamic Neural Models

Arxiv

0+阅读 · 2021年6月1日

Bayesian Reasoning with Trained Neural Networks

Arxiv

0+阅读 · 2021年6月1日

Adder Neural Networks

Arxiv

0+阅读 · 2021年6月1日

Event-Based Backpropagation can compute Exact Gradients for Spiking Neural Networks

Arxiv

0+阅读 · 2021年5月31日

Distributed Non-Convex Optimization with Sublinear Speedup under Intermittent Client Availability

Arxiv

11+阅读 · 2020年2月18日

Text Summarization with Pretrained Encoders

Arxiv

5+阅读 · 2019年8月22日

Interpretable Convolutional Neural Networks via Feedforward Design

Interpretable Convolutional Neural Networks via Feedforward Design

Arxiv

4+阅读 · 2018年10月5日

Learning Dynamic Memory Networks for Object Tracking

Arxiv

9+阅读 · 2018年3月20日

Constraint and Mathematical Programming Models for Integrated Port Container Terminal Operations

Arxiv

3+阅读 · 2017年12月14日

VIP会员

文章信息

相关主题

整数线性规划

相关VIP内容

【2021干货书】Python可解释人工智能，207页pdf，Explainable AI with Python

【2021干货书】Python可解释人工智能，207页pdf，Explainable AI with Python

专知会员服务

186+阅读 · 2021年5月17日

【IJCAJ 2020】多通道神经网络 Multi-Channel Graph Neural Networks

【IJCAJ 2020】多通道神经网络 Multi-Channel Graph Neural Networks

专知会员服务

26+阅读 · 2020年7月19日

【陈天奇】TVM：端到端自动深度学习编译器，244页ppt

【陈天奇】TVM：端到端自动深度学习编译器，244页ppt

专知会员服务

87+阅读 · 2020年5月11日

【ICLR2020】胶囊与反向路由点积注意力

专知会员服务

27+阅读 · 2020年2月15日

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

专知会员服务

23+阅读 · 2019年11月21日

【课程】Andrew Ng与Google Brain团队联合出品《TensorFlow in Practice 》

【课程】Andrew Ng与Google Brain团队联合出品《TensorFlow in Practice 》

专知会员服务

13+阅读 · 2019年10月29日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能治理的未来

模态感知的特征匹配：单一模态与跨模态技术的全面综述

无监督行人重识别研究综述

【牛津博士论文】面向神经影像应用的可扩展且可解释的空间模型

相关资讯

使用 Keras Tuner 调节超参数

使用 Keras Tuner 调节超参数

TensorFlow

15+阅读 · 2020年2月6日

Graph Neural Networks 综述

Graph Neural Networks 综述

计算机视觉life

30+阅读 · 2019年8月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

手把手教你用Python库Keras做预测（附代码）

手把手教你用Python库Keras做预测（附代码）

数据派THU

14+阅读 · 2018年5月30日

【CNN】一文读懂卷积神经网络CNN

【CNN】一文读懂卷积神经网络CNN

产业智能官

18+阅读 · 2018年1月2日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【推荐】TensorFlow手把手CNN实践指南

【推荐】TensorFlow手把手CNN实践指南

机器学习研究会

5+阅读 · 2017年8月17日

【推荐】(Keras)LSTM多元时序预测教程

【推荐】(Keras)LSTM多元时序预测教程

机器学习研究会

24+阅读 · 2017年8月14日

相关论文

Neural Network Training Using $\ell_1$-Regularization and Bi-fidelity Data

Arxiv

0+阅读 · 2021年6月1日

Student Performance Prediction Using Dynamic Neural Models

Student Performance Prediction Using Dynamic Neural Models

Arxiv

0+阅读 · 2021年6月1日

Bayesian Reasoning with Trained Neural Networks

Arxiv

0+阅读 · 2021年6月1日

Adder Neural Networks

Arxiv

0+阅读 · 2021年6月1日

Event-Based Backpropagation can compute Exact Gradients for Spiking Neural Networks

Arxiv

0+阅读 · 2021年5月31日

Distributed Non-Convex Optimization with Sublinear Speedup under Intermittent Client Availability

Arxiv

11+阅读 · 2020年2月18日

Text Summarization with Pretrained Encoders

Arxiv

5+阅读 · 2019年8月22日

Interpretable Convolutional Neural Networks via Feedforward Design

Interpretable Convolutional Neural Networks via Feedforward Design

Arxiv

4+阅读 · 2018年10月5日

Learning Dynamic Memory Networks for Object Tracking

Arxiv

9+阅读 · 2018年3月20日

Constraint and Mathematical Programming Models for Integrated Port Container Terminal Operations

Arxiv

3+阅读 · 2017年12月14日

微信扫码咨询专知VIP会员