Deep learning is the backbone of artificial intelligence technologies, and it can be regarded as a kind of multilayer feedforward neural network. An essence of deep learning is information propagation through layers. This suggests that there is a connection between deep neural networks and dynamical systems in the sense that information propagation is explicitly modeled by the time-evolution of dynamical systems. In this study, we perform pattern recognition based on the optimal control of continuous-time dynamical systems, which is suitable for physical hardware implementation. The learning is based on the adjoint method to optimally control dynamical systems, and the deep (virtual) network structures based on the time evolution of the systems are used for processing input information. As a key example, we apply the dynamics-based recognition approach to an optoelectronic delay system and demonstrate that the use of the delay system allows for image recognition and nonlinear classifications using only a few control signals. This is in contrast to conventional multilayer neural networks, which require a large number of weight parameters to be trained. The proposed approach provides insight into the mechanisms of deep network processing in the framework of an optimal control problem and presents a pathway for realizing physical computing hardware.
翻译:深层学习是人工智能技术的支柱,可以被视为一种多层向神经网络的多层进料。深层学习的精髓之一是通过层层传播信息。这表明深神经网络和动态系统之间有一种联系,即信息传播是由动态系统的时间演进所明确建模的。在这项研究中,我们根据适合物理硬件实施的连续时间动态系统的最佳控制来进行模式识别。学习的基础是优化控制动态系统的联合方法,以及基于系统时间演变的深(虚拟)网络结构用于处理输入信息。我们把基于动态的识别方法应用到一个光电子化延迟系统,并证明延迟系统的使用允许仅使用少量控制信号来识别图像和非线性分类。这与传统的多层神经网络形成对照,后者需要大量重参数才能得到培训。拟议的方法为在最佳控制问题框架内的深度网络处理机制提供了深入的洞察力,并提出了实现物理计算硬件的途径。