Hand-crafting effective and efficient structures for recurrent neural networks (RNNs) is a difficult, expensive, and time-consuming process. To address this challenge, we propose a novel neuro-evolution algorithm based on ant colony optimization (ACO), called ant swarm neuro-evolution (ASNE), for directly optimizing RNN topologies. The procedure selects from multiple modern recurrent cell types such as Delta-RNN, GRU, LSTM, MGU and UGRNN cells, as well as recurrent connections which may span multiple layers and/or steps of time. In order to introduce an inductive bias that encourages the formation of sparser synaptic connectivity patterns, we investigate several variations of the core algorithm. We do so primarily by formulating different functions that drive the underlying pheromone simulation process (which mimic L1 and L2 regularization in standard machine learning) as well as by introducing ant agents with specialized roles (inspired by how real ant colonies operate), i.e., explorer ants that construct the initial feed forward structure and social ants which select nodes from the feed forward connections to subsequently craft recurrent memory structures. We also incorporate a Lamarckian strategy for weight initialization which reduces the number of backpropagation epochs required to locally train candidate RNNs, speeding up the neuro-evolution process. Our results demonstrate that the sparser RNNs evolved by ASNE significantly outperform traditional one and two layer architectures consisting of modern memory cells, as well as the well-known NEAT algorithm. Furthermore, we improve upon prior state-of-the-art results on the time series dataset utilized in our experiments.
翻译:经常神经网络( RNN) 的手工艺有效且高效的结构是一个困难、昂贵且耗时的过程。 为了应对这一挑战,我们提议基于蚁群优化(ACO)的新型神经革命算法(ACO),称为蚂蚁群神经革命(ASNE),直接优化RNN的地形。该程序从多种现代经常性细胞类型(如Delta-RNNN、GRU、LSTM、MGU和UGNN)以及可能跨越多个层次和/或步骤的经常性连接中挑选出一种高效的高效结构。为了引入鼓励形成稀疏合成合成合成连接模式的诱导偏差,我们建议基于蚂蚁蚁群优化(ACO)的新神经革命算法(ACO),称为Ant swarm神经进化(ASNNNNE),以及引入具有特殊作用的蚂蚁剂(受真实蚂蚁作用的驱使), 也就是说, 探索分子在建立初始前层和社交细胞结构时,我们从前的节选择前端连接的节点, AS- IM IM IM AS IM IM IM IM IM 结构, IM IM 结构的初始化结构将一个快速化的精化, 递化的精化过程, 递化的精化, 以显示的精化的精化的精化过程的精化的精化的精化的精化的精化的精化过程。