通过溶液方法分析单隐藏-激光神经网络 (Analysis of One-Hidden-Layer Neural Networks via the Resolvent Method)

We compute the asymptotic empirical spectral distribution of a non-linear random matrix model by using the resolvent method. Motivated by random neural networks, we consider the random matrix $M = Y Y^\ast$ with $Y = f(WX)$, where $W$ and $X$ are random rectangular matrices with i.i.d. centred entries and $f$ is a non-linear smooth function which is applied entry-wise. We prove that the Stieltjes transform of the limiting spectral distribution satisfies a quartic self-consistent equation up to some error terms, which is exactly the equation obtained by [Pennington, Worah] and [Benigni, P\'{e}ch\'{e}] with the moment method approach. In addition, we extend the previous results to the case of additive bias $Y=f(WX+B)$ with $B$ being an independent rank-one Gaussian random matrix, closer modelling the neural network infrastructures encountering in practice. Our approach following the \emph{resolvent method} is more robust than the moment method and is expected to provide insights also for models where the combinatorics of the latter become intractable.

翻译：我们用固态方法计算非线性随机矩阵模型的非线性实验光谱分布。在随机神经网络的驱动下, 我们考虑随机矩阵 $M = Y Y ast$ = Y Y = f( WX)$, 其中W$ 和 $X$是随机的矩形矩阵, 使用 i. d. 中心条目和 $f$ 是一个非线性平滑功能, 使用输入。我们证明, 限制光谱分布的 Stieltjes 转换满足了某些错误条件的等式, 达到某些错误条件, 这正是[ Pennington, Worrah] 和 [ Benigni, P\\ { e}\ ch\ {e} 获得的等式, 即时尚方法。此外, 我们将先前的结果推广到添加偏差 $Y=f( WX+B) 的情况, 是一个独立的一等式随机矩阵, 更接近于实践中遇到的神经网络基础设施的模拟。我们采用的方法, 其中的时空洞察法也更可靠。

相关内容

Neural Networks

关注 1651

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/