Optical neural networks are emerging as a promising type of machine learning hardware capable of energy-efficient, parallel computation. Today's optical neural networks are mainly developed to perform optical inference after in silico training on digital simulators. However, various physical imperfections that cannot be accurately modelled may lead to the notorious reality gap between the digital simulator and the physical system. To address this challenge, we demonstrate hybrid training of optical neural networks where the weight matrix is trained with neuron activation functions computed optically via forward propagation through the network. We examine the efficacy of hybrid training with three different networks: an optical linear classifier, a hybrid opto-electronic network, and a complex-valued optical network. We perform a comparative study to in silico training, and our results show that hybrid training is robust against different kinds of static noise. Our platform-agnostic hybrid training scheme can be applied to a wide variety of optical neural networks, and this work paves the way towards advanced all-optical training in machine intelligence.
翻译:光学神经网络正在成为一个充满希望的机器学习硬件类型,能够进行节能、平行的计算。今天的光学神经网络主要是为了在数字模拟器的硅培训后进行光学推断。然而,无法准确模拟的各种物理缺陷可能导致数字模拟器和物理系统之间臭名昭著的现实差距。为了应对这一挑战,我们展示了光学神经网络的混合培训,其重量矩阵通过通过网络的远端传播以光学方式计算神经活化功能。我们审视了与三个不同网络的混合培训的功效:光学线性分类器、混合光学电子网络和复杂价值的光学网络。我们进行了在硅培训方面的比较研究,我们的结果显示,混合培训对不同类型的静态噪音是强有力的。我们的平台-敏感混合培训计划可以应用于广泛的光学神经网络,这项工作为先进的机器智能全光学培训铺平了道路。