带有内嵌的主观时间尺度的变式预测分流 (Variational Predictive Routing with Nested Subjective Timescales)

Discovery and learning of an underlying spatiotemporal hierarchy in sequential data is an important topic for machine learning. Despite this, little work has been done to explore hierarchical generative models that can flexibly adapt their layerwise representations in response to datasets with different temporal dynamics. Here, we present Variational Predictive Routing (VPR) - a neural probabilistic inference system that organizes latent representations of video features in a temporal hierarchy, based on their rates of change, thus modeling continuous data as a hierarchical renewal process. By employing an event detection mechanism that relies solely on the system's latent representations (without the need of a separate model), VPR is able to dynamically adjust its internal state following changes in the observed features, promoting an optimal organisation of representations across the levels of the model's latent hierarchy. Using several video datasets, we show that VPR is able to detect event boundaries, disentangle spatiotemporal features across its hierarchy, adapt to the dynamics of the data, and produce accurate time-agnostic rollouts of the future. Our approach integrates insights from neuroscience and introduces a framework with high potential for applications in model-based reinforcement learning, where flexible and informative state-space rollouts are of particular interest.

翻译：在连续数据中发现和学习一个深层时空等级是机器学习的一个重要专题。尽管如此,在探索等级基因化模型方面没有做多少工作,以根据不同时间动态的数据集灵活调整其分层表达方式以适应不同时间动态的数据集。在这里,我们介绍的是动态预测运行(VPR)-神经概率推断系统,根据时间等级的变化率在时间等级中组织视频特征的潜在表达方式,从而将连续数据建模为等级更新过程的模型。通过使用完全依赖系统潜在表现(不需要单独模型)的事件检测机制,VPR能够根据所观察到的特征的变化动态调整其内部状态,促进模型潜在等级之间最佳的表达方式。我们用几个视频数据集显示,VPR能够发现事件边界,在时间等级中分解随机波纹特征,适应数据的动态,并产生准确的时间-无序滚动。我们的方法整合了神经科学的洞察力,并引入了在模型中具有高度潜在兴趣的动态学习空间的强化框架。

相关内容

声纹识别

关注 444

说话人识别（Speaker Recognition），或者称为声纹识别（Voiceprint Recognition, VPR），是根据语音中所包含的说话人个性信息，利用计算机以及现在的信息识别技术，自动鉴别说话人身份的一种生物特征识别技术。说话人识别研究的目的就是从语音中提取具有说话人表征性的特征，建立有效的模型和系统，实现自动精准的说话人鉴别。

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日