Diffractive optical neural networks (DONNs) have attracted lots of attention as they bring significant advantages in terms of power efficiency, parallelism, and computational speed compared with conventional deep neural networks (DNNs), which have intrinsic limitations when implemented on digital platforms. However, inversely mapping algorithm-trained physical model parameters onto real-world optical devices with discrete values is a non-trivial task as existing optical devices have non-unified discrete levels and non-monotonic properties. This work proposes a novel device-to-system hardware-software codesign framework, which enables efficient physics-aware training of DONNs w.r.t arbitrary experimental measured optical devices across layers. Specifically, Gumbel-Softmax is employed to enable differentiable discrete mapping from real-world device parameters into the forward function of DONNs, where the physical parameters in DONNs can be trained by simply minimizing the loss function of the ML task. The results have demonstrated that our proposed framework offers significant advantages over conventional quantization-based methods, especially with low-precision optical devices. Finally, the proposed algorithm is fully verified with physical experimental optical systems in low-precision settings.
翻译:diffactive光学神经网络(DONNS)吸引了大量关注,因为它们在电效、平行和计算速度方面带来了与传统的深神经网络(DNNS)相比的巨大优势,这些网络在数字平台上实施时具有内在的局限性,然而,将经过算法训练的物理模型参数映射到具有离散值的实界光学设备上,是一项非三重性的任务,因为现有的光学设备具有非统一离散水平和非热质特性。这项工作提出了一个新的设备到系统的硬件软件代码标志框架,使DONNs(W.r.)能够对各层的任意实验测量光学设备进行有效的物理认知培训。具体地说,Gumbel-Softmax被用来帮助从现实世界设备参数中进行不同的离散映到DONNs前方功能,通过将ML任务的损失功能最小化来培训DONNs的物理参数。结果表明,我们提议的框架为常规的裁分法方法,特别是低精度光学光学设备,提供了显著的优势。最后,拟议的物理测算法与低光学物理光学环境完全核查。