Domain specific neural network accelerators have garnered attention because of their improved energy efficiency and inference performance compared to CPUs and GPUs. Such accelerators are thus well suited for resource-constrained embedded systems. However, mapping sophisticated neural network models on these accelerators still entails significant energy and memory consumption, along with high inference time overhead. Binarized neural networks (BNNs), which utilize single-bit weights, represent an efficient way to implement and deploy neural network models on accelerators. In this paper, we present a novel optical-domain BNN accelerator, named ROBIN, which intelligently integrates heterogeneous microring resonator optical devices with complementary capabilities to efficiently implement the key functionalities in BNNs. We perform detailed fabrication-process variation analyses at the optical device level, explore efficient corrective tuning for these devices, and integrate circuit-level optimization to counter thermal variations. As a result, our proposed ROBIN architecture possesses the desirable traits of being robust, energy-efficient, low latency, and high throughput, when executing BNN models. Our analysis shows that ROBIN can outperform the best-known optical BNN accelerators and also many electronic accelerators. Specifically, our energy-efficient ROBIN design exhibits energy-per-bit values that are ~4x lower than electronic BNN accelerators and ~933x lower than a recently proposed photonic BNN accelerator, while a performance-efficient ROBIN design shows ~3x and ~25x better performance than electronic and photonic BNN accelerators, respectively.
翻译:与 CPU 和 GPU 相比,某些特定的神经网络加速器因其能效和发酵性能的提高而引起人们的关注。 因此,这种加速器非常适合资源限制的嵌入系统。 然而,在这些加速器上绘制先进的神经网络模型仍然需要大量的能量和记忆消耗,加上高推力时间管理。 光学网络(BNNS)使用单位重量,是实施和部署加速器电路节率网络模型的有效方法。 在本文中,我们展示了一个新的光学多盘 BNNN 加速器,叫做 ROBIN,它明智地整合了多种混杂的显微镜光学光学设备,并具有互补能力,以高效地执行BNNS的关键功能。我们在光学设备一级进行详细的制造过程变异分析,探索对这些装置进行高效的校正调,并将电路级优化与反热变调相结合。 因此,我们提议的ROBIN 结构拥有一种合适的特征,即稳健、节能、低调、低调、高清晰度的BNNB 和高光学 设计显示一个比我们的最佳电子模型。