The number of parameters in deep neural networks (DNNs) is scaling at about 5$\times$ the rate of Moore's Law. To sustain this growth, photonic computing is a promising avenue, as it enables higher throughput in dominant general matrix-matrix multiplication (GEMM) operations in DNNs than their electrical counterpart. However, purely photonic systems face several challenges including lack of photonic memory and accumulation of noise. In this paper, we present an electro-photonic accelerator, ADEPT, which leverages a photonic computing unit for performing GEMM operations, a vectorized digital electronic ASIC for performing non-GEMM operations, and SRAM arrays for storing DNN parameters and activations. In contrast to prior works in photonic DNN accelerators, we adopt a system-level perspective and show that the gains while large are tempered relative to prior expectations. Our goal is to encourage architects to explore photonic technology in a more pragmatic way considering the system as a whole to understand its general applicability in accelerating today's DNNs. Our evaluation shows that ADEPT can provide, on average, 5.73$\times$ higher throughput per Watt compared to the traditional systolic arrays (SAs) in a full-system, and at least 6.8$\times$ and $2.5\times$ better throughput per Watt, compared to state-of-the-art electronic and photonic accelerators, respectively.
翻译:深神经网络( DNNS) 的参数数量正在以摩尔法律的速率以约5美元计。 为了维持这一增长,光度计算是一个充满希望的途径,因为它使得DNS中占主导地位的通用矩阵矩阵矩阵倍增(GEMM)操作的传输量高于其电源。然而,纯光度系统面临若干挑战,包括缺乏光度记忆和噪音累积。在本文中,我们展示了一个电子-光速加速器ADEPT,它利用一个光度计算单位来进行GEMM操作,一个用于进行非GEMM操作的矢量数字电子ASIC,以及用于存储 DNM参数和激活的SRAM阵列。与以前在DNNS光度加速器中的工作相比,我们采用了系统层面的观点,表明虽然大得比以往预期要慢。我们的目标是鼓励建筑师以更务实的方式探索光度技术,将该系统作为一个整体来理解其在今天加速 DNPNPS的运行中的通用应用性。我们的评估显示,在平均、最高汇率和最高汇率上,从最高汇率到最高汇率到最高汇率。