Token 图灵机 (Token Turing Machines) - 专知论文

会员服务 ·

0

词元分析器 · 视觉理解 · 变换 · 序列 · 神经图灵机 ·

2023 年 4 月 13 日

Token Turing Machines

翻译：Token 图灵机

Michael S. Ryoo,Keerthana Gopalakrishnan,Kumara Kahatapitiya,Ted Xiao,Kanishka Rao,Austin Stone,Yao Lu,Julian Ibarz,Anurag Arnab

from arxiv, CVPR 2023 camera-ready copy

We propose Token Turing Machines (TTM), a sequential, autoregressive Transformer model with memory for real-world sequential visual understanding. Our model is inspired by the seminal Neural Turing Machine, and has an external memory consisting of a set of tokens which summarise the previous history (i.e., frames). This memory is efficiently addressed, read and written using a Transformer as the processing unit/controller at each step. The model's memory module ensures that a new observation will only be processed with the contents of the memory (and not the entire history), meaning that it can efficiently process long sequences with a bounded computational cost at each step. We show that TTM outperforms other alternatives, such as other Transformer models designed for long sequences and recurrent neural networks, on two real-world sequential visual understanding tasks: online temporal activity detection from videos and vision-based robot action policy learning. Code is publicly available at: https://github.com/google-research/scenic/tree/main/scenic/projects/token_turing

翻译：我们提出了 Token 图灵机（TTM）。它是一种顺序的自回归 Transformer 模型，用于真实世界的序列视觉理解应用。我们的模型受到经典的神经图灵机启发，并具有外部记忆，由一组代表以前历史（即帧）的标记组成。该记忆通过在每个步骤中使用 Transformer 作为处理单元/控制器高效地寻址、读取和写入。模型的记忆模块确保新的观察会仅使用内存中的内容（而不是整个历史记录）进行处理，这意味着它可以在每个步骤上以有限的计算成本高效地处理长序列。我们展示了 TTM 在两项真实世界的序列视觉理解任务上的表现优于其他替代品，例如专为长序列设计的其他 Transformer 模型和循环神经网络：在线视频的时间活动检测和基于视觉的机器人动作策略学习。代码公开可用：https：//github.com/google-research/scenic/tree/main/scenic/projects/token_turing

0

相关内容

词元分析器

词元分析器

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

ICLR'23截稿, 图神经网络依然火热 (附42 篇好文整理)

ICLR'23截稿, 图神经网络依然火热 (附42 篇好文整理)

图与推荐

2+阅读 · 2022年10月5日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

自然语言处理 (NLP)资源大全

自然语言处理 (NLP)资源大全

机械鸡

35+阅读 · 2017年9月17日

HOTAIR/miR-326/SP1调控通路对非小细胞肺癌增殖、迁移和侵袭能力的影响及作用机制

国家自然科学基金

0+阅读 · 2015年12月31日

CD147和Annexin II调控肝癌细胞运动形式转换促进肝癌转移的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

NFATc1通过ATF3增强足细胞损伤的机制

国家自然科学基金

0+阅读 · 2014年12月31日

汉语声调远距离规则的内隐学习

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

随机延时神经网络的吸引子和分岔

国家自然科学基金

1+阅读 · 2012年12月31日

概率并发理论

国家自然科学基金

1+阅读 · 2011年12月31日

基于振动和声频信号HHT特征提取的高速列车轨道伤损探测方法研究

国家自然科学基金

0+阅读 · 2010年12月31日

整合素受体β#20122;基在FMDV自然感染过程中的作用研究

国家自然科学基金

0+阅读 · 2010年12月31日

基于σ#960;能量分解的平面多配位原子稳定性及配位性的研究

国家自然科学基金

0+阅读 · 2008年12月31日

Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective

Arxiv

0+阅读 · 2023年6月1日

On the Capacity of Secure $K$-user Product Computation over a Quantum MAC

Arxiv

0+阅读 · 2023年5月31日

K-SHAP: Policy Clustering Algorithm for Anonymous State-Action Pairs

Arxiv

0+阅读 · 2023年5月31日

Writing user personas with Large Language Models: Testing phase 6 of a Thematic Analysis of semi-structured interviews

Arxiv

0+阅读 · 2023年5月29日

Functional Flow Matching

Arxiv

0+阅读 · 2023年5月26日

Addressing bias in online selection with limited budget of comparisons

Arxiv

0+阅读 · 2023年5月26日

Neural Machine Translation for Mathematical Formulae

Arxiv

0+阅读 · 2023年5月25日

Riemannian Flow Matching on General Geometries

Arxiv

0+阅读 · 2023年5月25日

A Comprehensive Survey and Performance Analysis of Activation Functions in Deep Learning

A Comprehensive Survey and Performance Analysis of Activation Functions in Deep Learning

Arxiv

23+阅读 · 2021年9月29日

Self-correcting Q-Learning

Arxiv

11+阅读 · 2020年12月2日

VIP会员

文章信息

相关主题

词元分析器

神经图灵机

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

ICLR'23截稿, 图神经网络依然火热 (附42 篇好文整理)

ICLR'23截稿, 图神经网络依然火热 (附42 篇好文整理)

图与推荐

2+阅读 · 2022年10月5日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

自然语言处理 (NLP)资源大全

自然语言处理 (NLP)资源大全

机械鸡

35+阅读 · 2017年9月17日

相关论文

Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective

Arxiv

0+阅读 · 2023年6月1日

On the Capacity of Secure $K$-user Product Computation over a Quantum MAC

Arxiv

0+阅读 · 2023年5月31日

K-SHAP: Policy Clustering Algorithm for Anonymous State-Action Pairs

Arxiv

0+阅读 · 2023年5月31日

Writing user personas with Large Language Models: Testing phase 6 of a Thematic Analysis of semi-structured interviews

Arxiv

0+阅读 · 2023年5月29日

Functional Flow Matching

Arxiv

0+阅读 · 2023年5月26日

Addressing bias in online selection with limited budget of comparisons

Arxiv

0+阅读 · 2023年5月26日

Neural Machine Translation for Mathematical Formulae

Arxiv

0+阅读 · 2023年5月25日

Riemannian Flow Matching on General Geometries

Arxiv

0+阅读 · 2023年5月25日

A Comprehensive Survey and Performance Analysis of Activation Functions in Deep Learning

A Comprehensive Survey and Performance Analysis of Activation Functions in Deep Learning

Arxiv

23+阅读 · 2021年9月29日

Self-correcting Q-Learning

Arxiv

11+阅读 · 2020年12月2日

相关基金

HOTAIR/miR-326/SP1调控通路对非小细胞肺癌增殖、迁移和侵袭能力的影响及作用机制

国家自然科学基金

0+阅读 · 2015年12月31日

CD147和Annexin II调控肝癌细胞运动形式转换促进肝癌转移的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

NFATc1通过ATF3增强足细胞损伤的机制

国家自然科学基金

0+阅读 · 2014年12月31日

汉语声调远距离规则的内隐学习

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

随机延时神经网络的吸引子和分岔

国家自然科学基金

1+阅读 · 2012年12月31日

概率并发理论

国家自然科学基金

1+阅读 · 2011年12月31日

基于振动和声频信号HHT特征提取的高速列车轨道伤损探测方法研究

国家自然科学基金

0+阅读 · 2010年12月31日

整合素受体β#20122;基在FMDV自然感染过程中的作用研究

国家自然科学基金

0+阅读 · 2010年12月31日

基于σ#960;能量分解的平面多配位原子稳定性及配位性的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员