The edge-of-chaos dynamics of wide randomly initialized low-rank feedforward networks are analyzed. Formulae for the optimal weight and bias variances are extended from the full-rank to low-rank setting and are shown to follow from multiplicative scaling. The principle second order effect, the variance of the input-output Jacobian, is derived and shown to increase as the rank to width ratio decreases. These results inform practitioners how to randomly initialize feedforward networks with a reduced number of learnable parameters while in the same ambient dimension, allowing reductions in the computational cost and memory constraints of the associated network.
翻译:分析随机随机初始化低级饲料向前网络的热点边缘动态。最佳重量和偏差公式从全位向低位延伸,并显示从倍增效应后。原则的第二级效应,即输入-输出 Jacobian 的差异,从中得出,并显示随着排位到宽度比例的下降而增加。这些结果为实践者如何随机初始化具有较少可学习参数的向前网络,同时在相同的环境层面,从而可以减少相关网络的计算成本和记忆限制。