用于培训具有直接反馈协调的深神经网络的单状硅光学建筑 (Monolithic Silicon Photonic Architecture for Training Deep Neural Networks with Direct Feedback Alignment)

The field of artificial intelligence (AI) has witnessed tremendous growth in recent years, however some of the most pressing challenges for the continued development of AI systems are the fundamental bandwidth, energy efficiency, and speed limitations faced by electronic computer architectures. There has been growing interest in using photonic processors for performing neural network inference operations, however these networks are currently trained using standard digital electronics. Here, we propose on-chip training of neural networks enabled by a CMOS-compatible silicon photonic architecture to harness the potential for massively parallel, efficient, and fast data operations. Our scheme employs the direct feedback alignment training algorithm, which trains neural networks using error feedback rather than error backpropagation, and can operate at speeds of trillions of multiply-accumulate (MAC) operations per second while consuming less than one picojoule per MAC operation. The photonic architecture exploits parallelized matrix-vector multiplications using arrays of microring resonators for processing multi-channel analog signals along single waveguide buses to calculate the gradient vector of each neural network layer in situ, which is the most computationally expensive operation performed during the backward pass. We also experimentally demonstrate training a deep neural network with the MNIST dataset using on-chip MAC operation results. Our novel approach for efficient, ultra-fast neural network training showcases photonics as a promising platform for executing AI applications.

翻译：近年来,人工智能领域(AI)出现了巨大的增长,然而,对继续开发人工智能系统来说,一些最紧迫的挑战是电子计算机结构所面临的基本带宽、能效和速度限制。人们越来越有兴趣使用光子处理器进行神经网络导火线操作,然而,这些网络目前使用标准数字电子设备进行培训。在这里,我们提议对神经网络进行芯片培训,由CMOS兼容的相兼容相像光谱结构提供能力,以利用大规模平行、高效和快速数据操作的潜力。我们的计划使用直接反馈调整培训算法,利用错误反馈而不是反向调整错误来培训神经网络,并且能够以数万亿倍倍累积操作的速度运作,同时每秒消耗不到一个微焦耳。这里,我们提议对神经网络进行同步培训,利用一系列的微光线再感应器处理单波控式导航客中多声波模拟信号,以计算每个地面神经网络层的梯度矢量矢量,这是我们进行最有逻辑性成本的运行方式。我们用一个最有弹性的超时空网络的操作结果。我们用一个最有计算成本的网络在进行。

相关内容

Neural Networks

关注 1648

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日