经常性神经网络中近似固定点 (Approximate Fixed-Points in Recurrent Neural Networks) - 专知论文

会员服务 ·

0

循环神经网络 · Neural Networks · Networking · Performer · 近似 ·

2021 年 6 月 4 日

Approximate Fixed-Points in Recurrent Neural Networks

翻译：经常性神经网络中近似固定点

Zhengxiong Wang,Anton Ragni

Recurrent neural networks are widely used in speech and language processing. Due to dependency on the past, standard algorithms for training these models, such as back-propagation through time (BPTT), cannot be efficiently parallelised. Furthermore, applying these models to more complex structures than sequences requires inference time approximations, which introduce inconsistency between inference and training. This paper shows that recurrent neural networks can be reformulated as fixed-points of non-linear equation systems. These fixed-points can be computed using an iterative algorithm exactly and in as many iterations as the length of any given sequence. Each iteration of this algorithm adds one additional Markovian-like order of dependencies such that upon termination all dependencies modelled by the recurrent neural networks have been incorporated. Although exact fixed-points inherit the same parallelization and inconsistency issues, this paper shows that approximate fixed-points can be computed in parallel and used consistently in training and inference including tasks such as lattice rescoring. Experimental validation is performed in two tasks, Penn Tree Bank and WikiText-2, and shows that approximate fixed-points yield competitive prediction performance to recurrent neural networks trained using the BPTT algorithm.

翻译：经常神经网络被广泛用于语言和语言处理。由于对过去的依赖性,培训这些模型的标准算法,如时反向分析(BBTT),无法有效平行。此外,将这些模型应用于比序列更复杂的结构比序列更复杂的结构需要推导时间近似,这就造成推论和培训之间的不一致。本文表明,经常性神经网络可以重新拟订为非线性方程系统的固定点。这些固定点可以精确地使用迭代算法和任何特定序列长度的迭代法来计算。这种算法的每一次迭代法都增加了一种类似于Markovian的依附顺序,因此在终止时,所有由经常性神经网络模拟的依附关系都得到吸收。虽然精确的固定点继承了相同的平行和不一致问题,但本文表明,大约固定点可以同时计算,在培训和推论中,包括拉蒂斯重线等任务。实验性验证是在两个任务中进行的,即Penn Trik和WikPext-2, 并显示,使用经过训练的固定点的经常结果预测,以经过训练的固定点竞争性算法进行。

0

相关内容

循环神经网络

循环神经网络

循环神经网络（RNN）是一类人工神经网络，其中节点之间的连接沿时间序列形成有向图。这使其表现出时间动态行为。 RNN源自前馈神经网络，可以使用其内部状态（内存）来处理可变长度的输入序列。这使得它们适用于诸如未分段的，连接的手写识别或语音识别之类的任务。

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

专知会员服务

90+阅读 · 2020年7月22日

【DeepMind深度学习课程】序列循环神经网络，141页ppt，Sequences and Recurrent Network

【DeepMind深度学习课程】序列循环神经网络，141页ppt，Sequences and Recurrent Network

专知会员服务

86+阅读 · 2020年6月23日

【论文推荐】Stochastic Graph Neural Networks，随机图神经网络

【论文推荐】Stochastic Graph Neural Networks，随机图神经网络

专知会员服务

69+阅读 · 2020年6月6日

【北京大学】动态异构图神经网络建模情感，Jointly Modeling Aspect and Sentiment with Dynamic Heterogeneous Graph Neural Networks

【北京大学】动态异构图神经网络建模情感，Jointly Modeling Aspect and Sentiment with Dynamic Heterogeneous Graph Neural Networks

专知会员服务

55+阅读 · 2020年4月15日

【Nature论文】深度网络中的梯度下降复杂度控制

【Nature论文】深度网络中的梯度下降复杂度控制

专知会员服务

41+阅读 · 2020年3月9日

【反馈循环自编码器】FEEDBACK RECURRENT AUTOENCODER

【反馈循环自编码器】FEEDBACK RECURRENT AUTOENCODER

专知会员服务

23+阅读 · 2020年1月28日

【Science论文】基于波的物理现象作为一种模拟递归神经网络（Wave physics as an analog recurrent neural network）

【Science论文】基于波的物理现象作为一种模拟递归神经网络（Wave physics as an analog recurrent neural network）

专知会员服务

12+阅读 · 2020年1月3日

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

专知会员服务

35+阅读 · 2019年11月30日

【NLP| 推荐文章】用图递归网络解决图的NLP问题（Tackling Graphical NLP problems with Graph Recurrent Networks）

【NLP| 推荐文章】用图递归网络解决图的NLP问题（Tackling Graphical NLP problems with Graph Recurrent Networks）

专知会员服务

33+阅读 · 2019年11月24日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

误差反向传播——RNN

误差反向传播——RNN

统计学习与视觉计算组

18+阅读 · 2018年9月6日

Jointly Improving Summarization and Sentiment Classification

Jointly Improving Summarization and Sentiment Classification

黑龙江大学自然语言处理实验室

3+阅读 · 2018年6月12日

干货｜从LSTM到Seq2Seq

干货｜从LSTM到Seq2Seq

全球人工智能

15+阅读 · 2018年1月9日

Simple Recurrent Unit For Sentence Classification

Simple Recurrent Unit For Sentence Classification

哈工大SCIR

6+阅读 · 2017年11月29日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【深度学习基础】4. Recurrent Neural Networks

【深度学习基础】4. Recurrent Neural Networks

微信AI

16+阅读 · 2017年7月19日

Time-Optimal Construction of Overlay Networks

Arxiv

0+阅读 · 2021年7月27日

Experiments on Properties of Hidden Structures of Sparse Neural Networks

Arxiv

0+阅读 · 2021年7月27日

Memory-Gated Recurrent Networks

Memory-Gated Recurrent Networks

Arxiv

12+阅读 · 2020年12月24日

Reversible Recurrent Neural Networks

Arxiv

3+阅读 · 2018年10月25日

Dynamic Graph Neural Networks

Arxiv

24+阅读 · 2018年10月24日

Inference in Probabilistic Graphical Models by Graph Neural Networks

Arxiv

3+阅读 · 2018年5月25日

Classical Structured Prediction Losses for Sequence to Sequence Learning

Arxiv

6+阅读 · 2018年5月24日

Differentiable Dynamic Programming for Structured Prediction and Attention

Arxiv

56+阅读 · 2018年2月20日

A Multilayer Convolutional Encoder-Decoder Neural Network for Grammatical Error Correction

Arxiv

5+阅读 · 2018年1月26日

Variational Recurrent Neural Machine Translation

Arxiv

5+阅读 · 2018年1月16日

VIP会员

文章信息

相关主题

循环神经网络

Neural Networks

相关VIP内容

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

专知会员服务

90+阅读 · 2020年7月22日

【DeepMind深度学习课程】序列循环神经网络，141页ppt，Sequences and Recurrent Network

【DeepMind深度学习课程】序列循环神经网络，141页ppt，Sequences and Recurrent Network

专知会员服务

86+阅读 · 2020年6月23日

【论文推荐】Stochastic Graph Neural Networks，随机图神经网络

【论文推荐】Stochastic Graph Neural Networks，随机图神经网络

专知会员服务

69+阅读 · 2020年6月6日

【北京大学】动态异构图神经网络建模情感，Jointly Modeling Aspect and Sentiment with Dynamic Heterogeneous Graph Neural Networks

【北京大学】动态异构图神经网络建模情感，Jointly Modeling Aspect and Sentiment with Dynamic Heterogeneous Graph Neural Networks

专知会员服务

55+阅读 · 2020年4月15日

【Nature论文】深度网络中的梯度下降复杂度控制

【Nature论文】深度网络中的梯度下降复杂度控制

专知会员服务

41+阅读 · 2020年3月9日

【反馈循环自编码器】FEEDBACK RECURRENT AUTOENCODER

【反馈循环自编码器】FEEDBACK RECURRENT AUTOENCODER

专知会员服务

23+阅读 · 2020年1月28日

【Science论文】基于波的物理现象作为一种模拟递归神经网络（Wave physics as an analog recurrent neural network）

【Science论文】基于波的物理现象作为一种模拟递归神经网络（Wave physics as an analog recurrent neural network）

专知会员服务

12+阅读 · 2020年1月3日

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

专知会员服务

35+阅读 · 2019年11月30日

【NLP| 推荐文章】用图递归网络解决图的NLP问题（Tackling Graphical NLP problems with Graph Recurrent Networks）

【NLP| 推荐文章】用图递归网络解决图的NLP问题（Tackling Graphical NLP problems with Graph Recurrent Networks）

专知会员服务

33+阅读 · 2019年11月24日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《无人机战争时代的战时法：大国竞争中的区分原则、相称性原则与行动建议》最新75页

《构建强健军事力量的设计挑战：提升海军兵力支持系统效能的多分辨率建模方法》69页

正视无人机心理战：恐惧效应与战略反思

《精确反蜂群防御系统：三维运动探测与定向空爆拦截技术融合》最新24页

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

误差反向传播——RNN

误差反向传播——RNN

统计学习与视觉计算组

18+阅读 · 2018年9月6日

Jointly Improving Summarization and Sentiment Classification

Jointly Improving Summarization and Sentiment Classification

黑龙江大学自然语言处理实验室

3+阅读 · 2018年6月12日

干货｜从LSTM到Seq2Seq

干货｜从LSTM到Seq2Seq

全球人工智能

15+阅读 · 2018年1月9日

Simple Recurrent Unit For Sentence Classification

Simple Recurrent Unit For Sentence Classification

哈工大SCIR

6+阅读 · 2017年11月29日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【深度学习基础】4. Recurrent Neural Networks

【深度学习基础】4. Recurrent Neural Networks

微信AI

16+阅读 · 2017年7月19日

相关论文

Time-Optimal Construction of Overlay Networks

Arxiv

0+阅读 · 2021年7月27日

Experiments on Properties of Hidden Structures of Sparse Neural Networks

Arxiv

0+阅读 · 2021年7月27日

Memory-Gated Recurrent Networks

Memory-Gated Recurrent Networks

Arxiv

12+阅读 · 2020年12月24日

Reversible Recurrent Neural Networks

Arxiv

3+阅读 · 2018年10月25日

Dynamic Graph Neural Networks

Arxiv

24+阅读 · 2018年10月24日

Inference in Probabilistic Graphical Models by Graph Neural Networks

Arxiv

3+阅读 · 2018年5月25日

Classical Structured Prediction Losses for Sequence to Sequence Learning

Arxiv

6+阅读 · 2018年5月24日

Differentiable Dynamic Programming for Structured Prediction and Attention

Arxiv

56+阅读 · 2018年2月20日

A Multilayer Convolutional Encoder-Decoder Neural Network for Grammatical Error Correction

Arxiv

5+阅读 · 2018年1月26日

Variational Recurrent Neural Machine Translation

Arxiv

5+阅读 · 2018年1月16日

微信扫码咨询专知VIP会员