训练基于脉冲表示的高性能、低延迟脉冲神经网络的微分方法 (Training High-Performance Low-Latency Spiking Neural Networks by Differentiation on Spike Representation) - 专知论文

会员服务 ·

0

脉冲 · 低延迟 · 脉冲神经网络 · 不可微 · 映射 ·

2023 年 3 月 30 日

Training High-Performance Low-Latency Spiking Neural Networks by Differentiation on Spike Representation

翻译：训练基于脉冲表示的高性能、低延迟脉冲神经网络的微分方法

Qingyan Meng,Mingqing Xiao,Shen Yan,Yisen Wang,Zhouchen Lin,Zhi-Quan Luo

from arxiv, Accepted by CVPR 2022

Spiking Neural Network (SNN) is a promising energy-efficient AI model when implemented on neuromorphic hardware. However, it is a challenge to efficiently train SNNs due to their non-differentiability. Most existing methods either suffer from high latency (i.e., long simulation time steps), or cannot achieve as high performance as Artificial Neural Networks (ANNs). In this paper, we propose the Differentiation on Spike Representation (DSR) method, which could achieve high performance that is competitive to ANNs yet with low latency. First, we encode the spike trains into spike representation using (weighted) firing rate coding. Based on the spike representation, we systematically derive that the spiking dynamics with common neural models can be represented as some sub-differentiable mapping. With this viewpoint, our proposed DSR method trains SNNs through gradients of the mapping and avoids the common non-differentiability problem in SNN training. Then we analyze the error when representing the specific mapping with the forward computation of the SNN. To reduce such error, we propose to train the spike threshold in each layer, and to introduce a new hyperparameter for the neural models. With these components, the DSR method can achieve state-of-the-art SNN performance with low latency on both static and neuromorphic datasets, including CIFAR-10, CIFAR-100, ImageNet, and DVS-CIFAR10.

翻译：脉冲神经网络(SNN)是一种有望在神经形态硬件上实现高效能 AI 的模型。然而，由于其不可微性，有效地训练 SNN 是一项挑战。大多数现有方法要么遭受高延迟(即，长模拟时间步)，要么不能达到与人工神经网络(ANNs)相当的高性能。在本文中，我们提出了不同于 Spike RepTran 法的 Differentiation on Spike Representation (DSR) 方法，旨在实现低延迟且竞争力与 ANNs 相当的高性能。首先，我们使用(加权的)发放率编码将脉冲列编码为脉冲表示。基于脉冲表示，该文系统地推导出具有常见神经模型的脉冲动力学可被表示为一些次可微映射。在这个视角下，我们提出的 DSR 方法通过这种映射的梯度来训练 SNN，并避免了 SNN 训练中常见的不可微性问题。然后我们分析了使用 SNN 的前向计算来表示特定的映射时的误差。为了减少这种误差，我们建议训练每个层中的脉冲阈值，并引入一个新的超参数来调整神经模型。通过这些组件，DSR 方法可以在静态和神经形态的数据集上实现最先进的 SNN 性能和低延迟，包括 CIFAR-10、CIFAR-100、ImageNet 和 DVS-CIFAR10。

0

相关内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

专知会员服务

35+阅读 · 2020年4月15日

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

专知会员服务

26+阅读 · 2020年3月26日

【ICLR2020】深度神经网络优化轨迹的平衡点，The Break-Even Point on Optimization Trajectories of Deep Neural Networks

【ICLR2020】深度神经网络优化轨迹的平衡点，The Break-Even Point on Optimization Trajectories of Deep Neural Networks

专知会员服务

34+阅读 · 2020年2月27日

【谷歌大脑新论文】利用可微摄动优化器进行学习，Learning with Differentiable Perturbed Optimizers

【谷歌大脑新论文】利用可微摄动优化器进行学习，Learning with Differentiable Perturbed Optimizers

专知会员服务

29+阅读 · 2020年2月22日

【ICLR2020-】基于记忆的图网络，MEMORY-BASED GRAPH NETWORKS

【ICLR2020-】基于记忆的图网络，MEMORY-BASED GRAPH NETWORKS

专知会员服务

110+阅读 · 2020年2月22日

【论文】生成式教学网络:通过学习生成合成训练数据来加速神经结构搜索（Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data）

【论文】生成式教学网络:通过学习生成合成训练数据来加速神经结构搜索（Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data）

专知会员服务

14+阅读 · 2019年11月17日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning，33页ppt

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning，33页ppt

专知

71+阅读 · 2020年2月29日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

初学者系列：Neural Factorization Machines 神经因子分解机详解

初学者系列：Neural Factorization Machines 神经因子分解机详解

专知

50+阅读 · 2019年9月9日

深度卷积神经网络中的降采样

深度卷积神经网络中的降采样

极市平台

12+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新十篇度量学习相关论文—可量化表示、非线性度量学习、在线深度量学习、大间隔最近邻、判别深度度量、域自适应

【论文推荐】最新十篇度量学习相关论文—可量化表示、非线性度量学习、在线深度量学习、大间隔最近邻、判别深度度量、域自适应

专知

12+阅读 · 2018年5月18日

前沿 | 简述脉冲神经网络SNN：下一代神经网络

前沿 | 简述脉冲神经网络SNN：下一代神经网络

机器之心

37+阅读 · 2018年1月13日

大规模多视角高维图像特征提取

国家自然科学基金

3+阅读 · 2017年12月31日

基于GPU的提高三维集成电路良率的测试数据优化方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

多项式微分系统的定性分析与周期解分支

国家自然科学基金

0+阅读 · 2013年12月31日

脉冲延迟微分方程数值分析

国家自然科学基金

0+阅读 · 2012年12月31日

脉冲神经网络的新结构与学习算法研究

国家自然科学基金

1+阅读 · 2012年12月31日

Volterra积分微分方程高效谱配置方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

模块化非线性系统辨识

国家自然科学基金

0+阅读 · 2011年12月31日

广义受限系统的分析与优化设计

国家自然科学基金

0+阅读 · 2010年12月31日

参数多项式方程组求解及其在机器证明中的应用

国家自然科学基金

0+阅读 · 2009年12月31日

基于混沌动力学系统的压缩采样

国家自然科学基金

0+阅读 · 2009年12月31日

Contact Optimization with Learning from Demonstration: Application in Long-term Non-prehensile Planar Manipulation

Arxiv

0+阅读 · 2023年5月19日

Spikingformer: Spike-driven Residual Learning for Transformer-based Spiking Neural Network

Spikingformer: Spike-driven Residual Learning for Transformer-based Spiking Neural Network

Arxiv

0+阅读 · 2023年5月19日

Beyond Exponential Graph: Communication-Efficient Topologies for Decentralized Learning via Finite-time Convergence

Arxiv

0+阅读 · 2023年5月19日

SPENSER: Towards a NeuroEvolutionary Approach for Convolutional Spiking Neural Networks

Arxiv

0+阅读 · 2023年5月18日

Exploring Tradeoffs in Spiking Neural Networks

Arxiv

0+阅读 · 2023年5月18日

MINT: Multiplier-less Integer Quantization for Spiking Neural Networks

Arxiv

0+阅读 · 2023年5月16日

Distributed Graph Neural Network Training: A Survey

Arxiv

16+阅读 · 2022年11月1日

Faster Meta Update Strategy for Noise-Robust Deep Learning

Arxiv

11+阅读 · 2021年4月30日

Model-Contrastive Federated Learning

Arxiv

10+阅读 · 2021年3月30日

A Survey of Model Compression and Acceleration for Deep Neural Networks

Arxiv

66+阅读 · 2019年9月8日

VIP会员

文章信息

相关主题

脉冲神经网络

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

专知会员服务

35+阅读 · 2020年4月15日

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

专知会员服务

26+阅读 · 2020年3月26日

【ICLR2020】深度神经网络优化轨迹的平衡点，The Break-Even Point on Optimization Trajectories of Deep Neural Networks

【ICLR2020】深度神经网络优化轨迹的平衡点，The Break-Even Point on Optimization Trajectories of Deep Neural Networks

专知会员服务

34+阅读 · 2020年2月27日

【谷歌大脑新论文】利用可微摄动优化器进行学习，Learning with Differentiable Perturbed Optimizers

【谷歌大脑新论文】利用可微摄动优化器进行学习，Learning with Differentiable Perturbed Optimizers

专知会员服务

29+阅读 · 2020年2月22日

【ICLR2020-】基于记忆的图网络，MEMORY-BASED GRAPH NETWORKS

【ICLR2020-】基于记忆的图网络，MEMORY-BASED GRAPH NETWORKS

专知会员服务

110+阅读 · 2020年2月22日

【论文】生成式教学网络:通过学习生成合成训练数据来加速神经结构搜索（Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data）

【论文】生成式教学网络:通过学习生成合成训练数据来加速神经结构搜索（Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data）

专知会员服务

14+阅读 · 2019年11月17日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

卫星导航技术发展综述

《美军"僚机"联合能力技术演示项目：有人-无人火炮作战》41页报告

美军条令《火力指挥》116页

可解释的人工智能在生物医学图像分析中的应用综述

相关资讯

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning，33页ppt

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning，33页ppt

专知

71+阅读 · 2020年2月29日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

初学者系列：Neural Factorization Machines 神经因子分解机详解

初学者系列：Neural Factorization Machines 神经因子分解机详解

专知

50+阅读 · 2019年9月9日

深度卷积神经网络中的降采样

深度卷积神经网络中的降采样

极市平台

12+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新十篇度量学习相关论文—可量化表示、非线性度量学习、在线深度量学习、大间隔最近邻、判别深度度量、域自适应

【论文推荐】最新十篇度量学习相关论文—可量化表示、非线性度量学习、在线深度量学习、大间隔最近邻、判别深度度量、域自适应

专知

12+阅读 · 2018年5月18日

前沿 | 简述脉冲神经网络SNN：下一代神经网络

前沿 | 简述脉冲神经网络SNN：下一代神经网络

机器之心

37+阅读 · 2018年1月13日

相关论文

Contact Optimization with Learning from Demonstration: Application in Long-term Non-prehensile Planar Manipulation

Arxiv

0+阅读 · 2023年5月19日

Spikingformer: Spike-driven Residual Learning for Transformer-based Spiking Neural Network

Spikingformer: Spike-driven Residual Learning for Transformer-based Spiking Neural Network

Arxiv

0+阅读 · 2023年5月19日

Beyond Exponential Graph: Communication-Efficient Topologies for Decentralized Learning via Finite-time Convergence

Arxiv

0+阅读 · 2023年5月19日

SPENSER: Towards a NeuroEvolutionary Approach for Convolutional Spiking Neural Networks

Arxiv

0+阅读 · 2023年5月18日

Exploring Tradeoffs in Spiking Neural Networks

Arxiv

0+阅读 · 2023年5月18日

MINT: Multiplier-less Integer Quantization for Spiking Neural Networks

Arxiv

0+阅读 · 2023年5月16日

Distributed Graph Neural Network Training: A Survey

Arxiv

16+阅读 · 2022年11月1日

Faster Meta Update Strategy for Noise-Robust Deep Learning

Arxiv

11+阅读 · 2021年4月30日

Model-Contrastive Federated Learning

Arxiv

10+阅读 · 2021年3月30日

A Survey of Model Compression and Acceleration for Deep Neural Networks

Arxiv

66+阅读 · 2019年9月8日

相关基金

大规模多视角高维图像特征提取

国家自然科学基金

3+阅读 · 2017年12月31日

基于GPU的提高三维集成电路良率的测试数据优化方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

多项式微分系统的定性分析与周期解分支

国家自然科学基金

0+阅读 · 2013年12月31日

脉冲延迟微分方程数值分析

国家自然科学基金

0+阅读 · 2012年12月31日

脉冲神经网络的新结构与学习算法研究

国家自然科学基金

1+阅读 · 2012年12月31日

Volterra积分微分方程高效谱配置方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

模块化非线性系统辨识

国家自然科学基金

0+阅读 · 2011年12月31日

广义受限系统的分析与优化设计

国家自然科学基金

0+阅读 · 2010年12月31日

参数多项式方程组求解及其在机器证明中的应用

国家自然科学基金

0+阅读 · 2009年12月31日

基于混沌动力学系统的压缩采样

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员