An important issue in dealing with Deep Convolutional Neural Networks (DCNN) is the 'black box problem', which represents the unknowns about internal information representation and processing, especially in the middle and higher layers. In this study, we adopted a systems neuroscience methodology to measure the visual feature selectivity and visualize the spatial receptive field of the units in VGG16. Orientation and spatial frequency tunings of each unit were measured using sinusoidal grating stimuli. The image category selectivity of each unit was also measured using natural image stimuli. The spatial structures of the receptive fields of all convolutional units were estimated by activation-weighted average (AWA) and activation-weighted covariance (AWC) analyses. In the middle layers (convolutional layers in block3 and block4), AWC analysis successfully reconstructed the receptive field that predicted the visual feature selectivity of the unit. Those results suggested the possibility that analyzing the reconstructed receptive field structure can be used to interpret the functional significance of the units and layers of a DCNN.
翻译:处理深革命神经网络(DCNN)的一个重要问题是“黑盒子问题”,它代表了内部信息代表性和处理的未知因素,特别是在中层和上层。在本研究中,我们采用了一种系统神经科学方法,以测量VGG16单元的视觉特征选择性和可视空间可容场。每个单元的方向和空间频率调试都是用正弦缩放模拟来测量的。每个单元的图像类别选择性也是用自然图像刺激度来衡量的。所有革命单元的可接收场的空间结构是通过激活加权平均(AWA)和振动加权共变(AWC)分析来估计的。在中间层(3区和4区),AWC分析成功地重建了预测该单元视觉选择性的可容场。这些结果表明,有可能利用分析重建的可容场结构来解释DCNN的单元和层次的功能意义。