In the application of neural networks, we need to select a suitable model based on the problem complexity and the dataset scale. To analyze the network's capacity, quantifying the information learned by the network is necessary. This paper proves that the distance between the neural network weights in different training stages can be used to estimate the information accumulated by the network in the training process directly. The experiment results verify the utility of this method. An application of this method related to the label corruption is shown at the end.
翻译:在应用神经网络时,我们需要根据问题的复杂性和数据集的规模选择一个合适的模型。为了分析网络的能力,有必要量化网络所学到的信息。本文件证明,在不同培训阶段神经网络重量之间的距离可用于直接估计网络在培训过程中积累的信息。实验结果验证了这种方法的效用。这一方法的应用与标签腐败有关,在结尾处展示了该方法的应用情况。