Artificial neural networks that can recover latent dynamics from recorded neural activity may provide a powerful avenue for identifying and interpreting the dynamical motifs underlying biological computation. Given that neural variance alone does not uniquely determine a latent dynamical system, interpretable architectures should prioritize accurate and low-dimensional latent dynamics. In this work, we evaluated the performance of sequential autoencoders (SAEs) in recovering latent chaotic attractors from simulated neural datasets. We found that SAEs with widely-used recurrent neural network (RNN)-based dynamics were unable to infer accurate firing rates at the true latent state dimensionality, and that larger RNNs relied upon dynamical features not present in the data. On the other hand, SAEs with neural ordinary differential equation (NODE)-based dynamics inferred accurate rates at the true latent state dimensionality, while also recovering latent trajectories and fixed point structure. Ablations reveal that this is mainly because NODEs (1) allow use of higher-capacity multi-layer perceptrons (MLPs) to model the vector field and (2) predict the derivative rather than the next state. Decoupling the capacity of the dynamics model from its latent dimensionality enables NODEs to learn the requisite low-D dynamics where RNN cells fail. Additionally, the fact that the NODE predicts derivatives imposes a useful autoregressive prior on the latent states. The suboptimal interpretability of widely-used RNN-based dynamics may motivate substitution for alternative architectures, such as NODE, that enable learning of accurate dynamics in low-dimensional latent spaces.
翻译:人造神经网络可以从记录的神经活动中恢复潜在动力学,这可能为识别和解释生物计算中的动态模式提供了有效途径。鉴于单纯神经方差无法唯一确定潜在动力学系统,可解释性体系结构应优先考虑准确和低维的潜在动态。本研究中,我们评估了顺序自编码器(SAEs)在从模拟神经数据集中恢复潜在混沌吸引子方面的性能。我们发现,使用基于循环神经网络(RNN)的动态的SAEs无法在真实潜在状态维度处推断出准确的放电率,而更大的RNN则依赖于数据中没有的动态特征。另一方面,具有基于神经常微分方程(NODE)的动态的SAEs推测出真实潜在状态维度处的准确率,同时还恢复了潜在轨迹和固定点结构。消融试验表明,这主要是因为NODEs (1) 允许使用高容量多层感知器(MLPs)来建模矢量场,和 (2) 预测导数而不是下一个状态。将动态模型的容量从其潜在维度解耦使NODE能够学习到循环神经元失败的所需低维动态。此外,NODE预测导数的事实为潜在状态带来了有用的自回归先验。广泛使用的基于RNN的动态的次优可解释性可能会促进对替代架构(如NODE)的替换,这些替代架构使学习在低维潜在空间中的准确动态成为可能。