Deep learning has led to unprecedented successes in solving some very difficult problems in domains such as computer vision, natural language processing, and general pattern recognition. These achievements are the culmination of decades-long research into better training techniques and deeper neural network models, as well as improvements in hardware platforms that are used to train and execute the deep neural network models. Many application-specific integrated circuit (ASIC) hardware accelerators for deep learning have garnered interest in recent years due to their improved performance and energy-efficiency over conventional CPU and GPU architectures. However, these accelerators are constrained by fundamental bottlenecks due to 1) the slowdown in CMOS scaling, which has limited computational and performance-per-watt capabilities of emerging electronic processors, and 2) the use of metallic interconnects for data movement, which do not scale well and are a major cause of bandwidth, latency, and energy inefficiencies in almost every contemporary processor. Silicon photonics has emerged as a promising CMOS-compatible alternative to realize a new generation of deep learning accelerators that can use light for both communication and computation. This article surveys the landscape of silicon photonics to accelerate deep learning, with a coverage of developments across design abstractions in a bottom-up manner, to convey both the capabilities and limitations of the silicon photonics paradigm in the context of deep learning acceleration.
翻译:深层学习导致在解决计算机视觉、自然语言处理和一般模式认知等领域一些非常困难的问题方面取得了前所未有的成功,这些成就是数十年来对更好的培训技术和更深的神经网络模型进行的研究,以及用于培训和实施深神经网络模型的硬件平台的改进,许多具体应用的综合电路(ASIC)硬件加速器近年来由于在常规CPU和GPU结构上提高了性能和能源效率而引起了兴趣。然而,这些加速器受到基本瓶颈的制约,其原因包括:(1) CMOS的缩放速度放慢,使新兴电子处理器的计算和性能能力都有限,以及(2) 数据移动使用金属相互连接,这种连接的规模不高,是几乎每个现代处理器中带宽、拉宽和能源效率高的主要原因。 硅光控器已成为一个很有希望的加速器,可以实现新一代的深层次学习加速器的加速,可以在深度的通信和计算能力中使用光学的光学和深层模型,从而加速了深度的学习,从而加速了在深度的模型设计上的发展。