通用代理 (A Generalist Agent) - 专知论文

会员服务 ·

0

Atari · 相同 · Weight · 块 · 相似度 ·

2022 年 5 月 12 日

A Generalist Agent

翻译：通用代理

Scott Reed,Konrad Zolna,Emilio Parisotto,Sergio Gomez Colmenarejo,Alexander Novikov,Gabriel Barth-Maron,Mai Gimenez,Yury Sulsky,Jackie Kay,Jost Tobias Springenberg,Tom Eccles,Jake Bruce,Ali Razavi,Ashley Edwards,Nicolas Heess,Yutian Chen,Raia Hadsell,Oriol Vinyals,Mahyar Bordbar,Nando de Freitas

Inspired by progress in large-scale language modeling, we apply a similar approach towards building a single generalist agent beyond the realm of text outputs. The agent, which we refer to as Gato, works as a multi-modal, multi-task, multi-embodiment generalist policy. The same network with the same weights can play Atari, caption images, chat, stack blocks with a real robot arm and much more, deciding based on its context whether to output text, joint torques, button presses, or other tokens. In this report we describe the model and the data, and document the current capabilities of Gato.

翻译：在大规模语言建模进步的启发下,我们在文本输出领域之外,对建立一个单一的通用工具也采用了类似的方法。我们称之为Gato的代理机构是一个多式、多式、多式、多式、多式、多式的通用政策。同样重的网络可以播放Atari、字幕图像、聊天、带有真正的机器人臂的烟囱块,甚至更多,根据它的背景来决定是输出文本、联合托盘、按钮按键还是其它标志。我们在本报告中描述模型和数据,并记录Gato目前的能力。

0

相关内容

Atari

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

Sestrin2/AMPK信号通路调控新生鼠缺氧缺血脑损伤细胞自噬的新机制

国家自然科学基金

0+阅读 · 2015年12月31日

MRTF-A和STAT3调控乳腺癌EMT及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

关键词广告中的最优广告策略研究

国家自然科学基金

3+阅读 · 2014年12月31日

无人直升机大机动飞行鲁棒控制

国家自然科学基金

0+阅读 · 2012年12月31日

昼夜节律紊乱与原发性闭角型青光眼急性发作的关联性研究

国家自然科学基金

0+阅读 · 2012年12月31日

MANETs网络环境下的异构移动多机器人协同控制系统任务分配机制的研究

国家自然科学基金

1+阅读 · 2012年12月31日

吐鲁番绿洲人与自然耦合系统变化的驱动与适应机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

大规模Job shop排序问题渐近最优算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

毛母质细胞重编程毛乳头细胞诱导毛囊再生能力的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

MHC和NFκ#22810;态性及病毒突变在乙肝后肝癌中的交互作用

国家自然科学基金

0+阅读 · 2009年12月31日

The Case for a Single Model that can Both Generate Continuations and Fill in the Blank

Arxiv

0+阅读 · 2022年6月30日

Visual Pre-training for Navigation: What Can We Learn from Noise?

Arxiv

0+阅读 · 2022年6月30日

Visual grounding of abstract and concrete words: A response to Günther et al. (2020)

Arxiv

0+阅读 · 2022年6月30日

Automatic Pull Request Title Generation

Automatic Pull Request Title Generation

Arxiv

0+阅读 · 2022年6月30日

Chained Generalisation Bounds

Arxiv

0+阅读 · 2022年6月30日

Lookback for Learning to Branch

Arxiv

0+阅读 · 2022年6月30日

Collaborative Navigation and Manipulation of a Cable-towed Load by Multiple Quadrupedal Robots

Arxiv

0+阅读 · 2022年6月29日

Balanced Multimodal Learning via On-the-fly Gradient Modulation

Arxiv

13+阅读 · 2022年3月29日

A Survey on Multi-Task Learning

Arxiv

31+阅读 · 2021年3月29日

Rethinking Knowledge Graph Propagation for Zero-Shot Learning

Arxiv

17+阅读 · 2018年5月31日

VIP会员

文章信息

相关主题

相关VIP内容

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大型语言模型遇上文本属性图：一种融合框架与应用的综述

人工智能赋能自主武器与人类控制第三部分：人类控制与系统操作员 | 35页

【博士论文】用于概率程序与生成模型的变分推断

军事指挥控制系统：2025年5种用途

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

相关论文

The Case for a Single Model that can Both Generate Continuations and Fill in the Blank

Arxiv

0+阅读 · 2022年6月30日

Visual Pre-training for Navigation: What Can We Learn from Noise?

Arxiv

0+阅读 · 2022年6月30日

Visual grounding of abstract and concrete words: A response to Günther et al. (2020)

Arxiv

0+阅读 · 2022年6月30日

Automatic Pull Request Title Generation

Automatic Pull Request Title Generation

Arxiv

0+阅读 · 2022年6月30日

Chained Generalisation Bounds

Arxiv

0+阅读 · 2022年6月30日

Lookback for Learning to Branch

Arxiv

0+阅读 · 2022年6月30日

Collaborative Navigation and Manipulation of a Cable-towed Load by Multiple Quadrupedal Robots

Arxiv

0+阅读 · 2022年6月29日

Balanced Multimodal Learning via On-the-fly Gradient Modulation

Arxiv

13+阅读 · 2022年3月29日

A Survey on Multi-Task Learning

Arxiv

31+阅读 · 2021年3月29日

Rethinking Knowledge Graph Propagation for Zero-Shot Learning

Arxiv

17+阅读 · 2018年5月31日

相关基金

Sestrin2/AMPK信号通路调控新生鼠缺氧缺血脑损伤细胞自噬的新机制

国家自然科学基金

0+阅读 · 2015年12月31日

MRTF-A和STAT3调控乳腺癌EMT及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

关键词广告中的最优广告策略研究

国家自然科学基金

3+阅读 · 2014年12月31日

无人直升机大机动飞行鲁棒控制

国家自然科学基金

0+阅读 · 2012年12月31日

昼夜节律紊乱与原发性闭角型青光眼急性发作的关联性研究

国家自然科学基金

0+阅读 · 2012年12月31日

MANETs网络环境下的异构移动多机器人协同控制系统任务分配机制的研究

国家自然科学基金

1+阅读 · 2012年12月31日

吐鲁番绿洲人与自然耦合系统变化的驱动与适应机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

大规模Job shop排序问题渐近最优算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

毛母质细胞重编程毛乳头细胞诱导毛囊再生能力的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

MHC和NFκ#22810;态性及病毒突变在乙肝后肝癌中的交互作用

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员