Discovery and learning of an underlying spatiotemporal hierarchy in sequential data is an important topic for machine learning. Despite this, little work has been done to explore hierarchical generative models that can flexibly adapt their layerwise representations in response to datasets with different temporal dynamics. Here, we present Variational Predictive Routing (VPR) - a neural probabilistic inference system that organizes latent representations of video features in a temporal hierarchy, based on their rates of change, thus modeling continuous data as a hierarchical renewal process. By employing an event detection mechanism that relies solely on the system's latent representations (without the need of a separate model), VPR is able to dynamically adjust its internal state following changes in the observed features, promoting an optimal organisation of representations across the levels of the model's latent hierarchy. Using several video datasets, we show that VPR is able to detect event boundaries, disentangle spatiotemporal features across its hierarchy, adapt to the dynamics of the data, and produce accurate time-agnostic rollouts of the future. Our approach integrates insights from neuroscience and introduces a framework with high potential for applications in model-based reinforcement learning, where flexible and informative state-space rollouts are of particular interest.
翻译:在连续数据中发现和学习一个深层时空等级是机器学习的一个重要专题。尽管如此,在探索等级基因化模型方面没有做多少工作,以根据不同时间动态的数据集灵活调整其分层表达方式以适应不同时间动态的数据集。在这里,我们介绍的是动态预测运行(VPR)-神经概率推断系统,根据时间等级的变化率在时间等级中组织视频特征的潜在表达方式,从而将连续数据建模为等级更新过程的模型。通过使用完全依赖系统潜在表现(不需要单独模型)的事件检测机制,VPR能够根据所观察到的特征的变化动态调整其内部状态,促进模型潜在等级之间最佳的表达方式。我们用几个视频数据集显示,VPR能够发现事件边界,在时间等级中分解随机波纹特征,适应数据的动态,并产生准确的时间-无序滚动。我们的方法整合了神经科学的洞察力,并引入了在模型中具有高度潜在兴趣的动态学习空间的强化框架。