关口不是你们需要的区域NN国需要的 (Gates are not what you need in RNNs)

Recurrent neural networks have flourished in many areas. Consequently, we can see new RNN cells being developed continuously, usually by creating or using gates in a new, original way. But what if we told you that gates in RNNs are redundant? In this paper, we propose a new recurrent cell called Residual Recurrent Unit (RRU) which beats traditional cells and does not employ a single gate. It is based on the residual shortcut connection together with linear transformations, ReLU, and normalization. To evaluate our cell's effectiveness, we compare its performance against the widely-used GRU and LSTM cells and the recently proposed Mogrifier LSTM on several tasks including, polyphonic music modeling, language modeling, and sentiment analysis. Our experiments show that RRU outperforms the traditional gated units on most of these tasks. Also, it has better robustness to parameter selection, allowing immediate application in new tasks without much tuning. We have implemented the RRU in TensorFlow, and the code is made available at https://github.com/LUMII-Syslab/RRU .

翻译：经常性神经网络在许多领域蓬勃发展。因此, 我们可以看到新的RNN细胞正在不断发展, 通常是通过以新的、原创的方式创建或使用大门。但如果我们告诉你, RNN 的大门是多余的吗? 在本文中, 我们提议建立一个名为残余常务单元(RRU)的新的重复式细胞(RRU), 它可以击败传统细胞, 并且不使用单一的大门。它基于与线性变、 ReLU 和正常化的剩余捷径连接。为了评估我们细胞的效能, 我们将其性能与广泛使用的 GRU 和 LSTM 细胞以及最近提议的Morphorizer LSTM 的性能进行比较, 包括多功能音乐模型、语言模型和情绪分析。我们的实验显示, RRU 超越了大部分这些任务的传统常务单元。另外, 它对于参数的选择比较有力, 允许在不进行大量调控线性任务中立即应用。我们在TensorFlow 中实施了 RRU, 我们实施了 RU, 并在 http://github.com/ LU- Sylab/ RRU/ RRU/ RRURRU/ RRU/ RRU/ RRRU) 上提供该代码。

相关内容

长短期记忆网络

关注 120

长短期记忆网络(LSTM)是一种用于深度学习领域的人工回归神经网络(RNN)结构。与标准的前馈神经网络不同，LSTM具有反馈连接。它不仅可以处理单个数据点(如图像)，还可以处理整个数据序列(如语音或视频)。例如，LSTM适用于未分段、连接的手写识别、语音识别、网络流量或IDSs(入侵检测系统)中的异常检测等任务。

因果知识图谱自然语言理解

专知会员服务

81+阅读 · 2021年7月3日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

【AAAI2021】记忆门控循环网络

专知会员服务

50+阅读 · 2020年12月28日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日