The inner representation of deep neural networks (DNNs) is indecipherable, which makes it difficult to tune DNN models, control their training process, and interpret their outputs. In this paper, we propose a novel approach to investigate the inner representation of DNNs through topological data analysis (TDA). Persistent homology (PH), one of the outstanding methods in TDA, was employed for investigating the complexities of trained DNNs. We constructed clique complexes on trained DNNs and calculated the one-dimensional PH of DNNs. The PH reveals the combinational effects of multiple neurons in DNNs at different resolutions, which is difficult to be captured without using PH. Evaluations were conducted using fully connected networks (FCNs) and networks combining FCNs and convolutional neural networks (CNNs) trained on the MNIST and CIFAR-10 data sets. Evaluation results demonstrate that the PH of DNNs reflects both the excess of neurons and problem difficulty, making PH one of the prominent methods for investigating the inner representation of DNNs.
翻译:深神经网络的内部代表性是无法理解的,这使得难以调和DNN模型、控制其培训过程和解释其产出。在本文件中,我们提出一种新的方法,通过地形数据分析(TDA)来调查DNN的内在代表性。持久性同质学是TDA的突出方法之一,用于调查受过训练的DNN的复杂性。我们用受过训练的DNN网络建造了小组综合体,并计算了DNN的单维PH。PH揭示了DN多个神经元在不同决议中的组合效应,如果不使用PH,就很难捕捉到这种效应。评价是利用完全连接的网络(FCNs)和将FCNs和CARFAR-10数据集相结合的网络进行的。评价结果表明,DNN的PH反映了神经的过剩和问题的困难,使PH成为调查DNPN公司内部代表性的突出方法之一。