Spiking Neural Networks are often touted as brain-inspired learning models for the third wave of Artificial Intelligence. Although recent SNNs trained with supervised backpropagation show classification accuracy comparable to deep networks, the performance of unsupervised learning-based SNNs remains much lower. This paper presents a heterogeneous recurrent spiking neural network (HRSNN) with unsupervised learning for spatio-temporal classification of video activity recognition tasks on RGB (KTH, UCF11, UCF101) and event-based datasets (DVS128 Gesture). The key novelty of the HRSNN is that the recurrent layer in HRSNN consists of heterogeneous neurons with varying firing/relaxation dynamics, and they are trained via heterogeneous spike-time-dependent-plasticity (STDP) with varying learning dynamics for each synapse. We show that this novel combination of heterogeneity in architecture and learning method outperforms current homogeneous spiking neural networks. We further show that HRSNN can achieve similar performance to state-of-the-art backpropagation trained supervised SNN, but with less computation (fewer neurons and sparse connection) and less training data.
翻译:虽然最近接受过监督的SNN(SNN)的分类精确度与深层网络相当,但未经监督的基于学习的SNN(SNN)的性能仍然低得多。本文展示了一种杂交的重复性神经网络(HRSNN),对RGB(KTH、UCF11、UCF101)和以事件为基础的数据集(DVS128 Gesture)的视频活动识别任务进行了不受监督的分类。HRSNN(DVS128 Gesture)的关键新颖之处是,HRSNNN(H)的经常层由具有不同点火/放松动态的多异性神经元组成,它们通过多变性的超时依赖型神经网络(STDP(STDP)培训,每个神经神经网络的学习动态各不相同。我们显示,在建筑和学习方法中的异性能性能新组合超越了当前的同质神经网络(DVS128 GESNN)能够实现类似的性运行,但经过较少监管的神经-SROD(S-ROD-ROD)的后制数据连接(Sfornal-st-prapropalation)。