In this paper, we showed that adding within-layer recurrent connections to feed-forward neural network models could improve the performance of neural response prediction in early visual areas by up to 11 percent across different data sets and over tens of thousands of model configurations. To understand why recurrent models perform better, we propose that recurrent computation can be conceptualized as an ensemble of multiple feed-forward pathways of different lengths with shared parameters. By reformulating a recurrent model as a multi-path model and analyzing the recurrent model through its multi-path ensemble, we found that the recurrent model outperformed the corresponding feed-forward one due to the former's compact and implicit multi-path ensemble that allows approximating the complex function underlying recurrent biological circuits with efficiency. In addition, we found that the performance differences among recurrent models were highly correlated with the differences in their multi-path ensembles in terms of path lengths and path diversity; a balance of paths of different lengths in the ensemble was necessary for the model to achieve the best performance. Our studies shed light on the computational rationales and advantages of recurrent circuits for neural modeling and machine learning tasks in general.
翻译:在本文中,我们显示,在向导神经网络模型中增加层内重复连接,可以提高早期视觉地区神经反应预测的性能,在不同数据集和数万个模型配置中达到11%以上。为了理解为什么重复模型表现更好,我们提议,经常性计算可以概念化为不同长度和共享参数的多个向导路径的组合体。通过重塑一个作为多路模式的经常性模型,并通过多路共通物分析经常性模型,我们发现,由于前者的紧凑和隐含多路共通元素,使得经常生物电路的复杂功能能够以效率接近。此外,我们发现,经常性模型的性能差异与其在路径长度和路径多样性方面的多路方组合差异高度相关;通过多路共通模式中不同长度的路径平衡对于实现最佳性能是必要的。我们的研究揭示了模型的计算原理和经常电路的优势。