行为变形器:用一块石头克隆美元美元模式 (Behavior Transformers: Cloning $k$ modes with one stone) - 专知论文

会员服务 ·

0

Learning · 讲稿 · Extensibility · 变换 · Continuity ·

2022 年 6 月 22 日

Behavior Transformers: Cloning $k$ modes with one stone

翻译：行为变形器:用一块石头克隆美元美元模式

Nur Muhammad Mahi Shafiullah,Zichen Jeff Cui,Ariuntuya Altanzaya,Lerrel Pinto

from arxiv, Code and data available at https://github.com/notmahi/bet

While behavior learning has made impressive progress in recent times, it lags behind computer vision and natural language processing due to its inability to leverage large, human-generated datasets. Human behaviors have wide variance, multiple modes, and human demonstrations typically do not come with reward labels. These properties limit the applicability of current methods in Offline RL and Behavioral Cloning to learn from large, pre-collected datasets. In this work, we present Behavior Transformer (BeT), a new technique to model unlabeled demonstration data with multiple modes. BeT retrofits standard transformer architectures with action discretization coupled with a multi-task action correction inspired by offset prediction in object detection. This allows us to leverage the multi-modal modeling ability of modern transformers to predict multi-modal continuous actions. We experimentally evaluate BeT on a variety of robotic manipulation and self-driving behavior datasets. We show that BeT significantly improves over prior state-of-the-art work on solving demonstrated tasks while capturing the major modes present in the pre-collected datasets. Finally, through an extensive ablation study, we analyze the importance of every crucial component in BeT. Videos of behavior generated by BeT are available at https://notmahi.github.io/bet

翻译：虽然行为学习近些年来取得了令人印象深刻的进展,但它却落后于计算机视野和自然语言处理,原因是无法利用大型的、人为的数据集。人类行为存在巨大的差异、多种模式和人类演示通常不会带来奖赏标签。这些属性限制了离线 RL 和 Bhavior Cloin 中当前方法的适用性,以便从大型的、预收集的数据集中学习。在这项工作中,我们介绍了行为变异器(BeT),这是一种用多种模式模拟无标签的演示数据的新技术。BeT 改造标准变异器结构,配有行动分化,加上由物体探测预测抵消的多任务行动修正。这使我们能够利用现代变异器的多模式模型能力预测多模式的持续行动。我们实验性地评估了各种机器人操作和自我驱动行为数据集的BET。我们展示BeT大大改进了先前在解析所显示的任务方面的状态工作,同时捕捉到预收集数据集中存在的主要模式。最后,我们通过一个至关重要的视频/Benably 分析了每个关键的行为方式。我们制作的MAltimbalal

0

相关内容

Learning

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

两类分数阶发展方程解的适定性及吸引子

国家自然科学基金

0+阅读 · 2015年12月31日

间充质干细胞在原发性胆汁性肝硬化中作用的探究

国家自然科学基金

0+阅读 · 2014年12月31日

牛蒡子中Arctignan A，Lappaol C及其衍生物的合成和抗白血病活性研究

国家自然科学基金

0+阅读 · 2014年12月31日

BAG3在慢性淋巴细胞白血病凋亡及迁移中的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

形变可积系统的怪波解及几何结构

国家自然科学基金

0+阅读 · 2013年12月31日

三维流形Heegaard分解稳定化问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于卟啉和酞菁为光敏剂的单线态氧氧化反应研究

国家自然科学基金

0+阅读 · 2012年12月31日

Degasperis-Procesi方程若干控制问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

“多胺-环糊精-量子点”纳米药物转运系统的设计及生物活性研究

国家自然科学基金

0+阅读 · 2012年12月31日

Handling Constrained Optimization in Factor Graphs for Autonomous Navigation

Arxiv

0+阅读 · 2022年8月12日

Dropout is NOT All You Need to Prevent Gradient Leakage

Arxiv

0+阅读 · 2022年8月12日

Off-Policy Actor-Critic with Emphatic Weightings

Arxiv

0+阅读 · 2022年8月11日

Augmented Driver Behavior Models for High-Fidelity Simulation Study of Crash Detection Algorithms

Arxiv

0+阅读 · 2022年8月10日

Differentiable Inference of Temporal Logic Formulas

Arxiv

0+阅读 · 2022年8月10日

Incorporating social norms into a configurable agent-based model of the decision to perform commuting behaviour

Incorporating social norms into a configurable agent-based model of the decision to perform commuting behaviour

Arxiv

0+阅读 · 2022年8月10日

Interpolation and SAT-Based Model Checking Revisited: Adoption to Software Verification

Arxiv

0+阅读 · 2022年8月9日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

Rethinking Knowledge Graph Propagation for Zero-Shot Learning

Arxiv

17+阅读 · 2018年5月31日

VIP会员

文章信息

相关主题

相关VIP内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Handling Constrained Optimization in Factor Graphs for Autonomous Navigation

Arxiv

0+阅读 · 2022年8月12日

Dropout is NOT All You Need to Prevent Gradient Leakage

Arxiv

0+阅读 · 2022年8月12日

Off-Policy Actor-Critic with Emphatic Weightings

Arxiv

0+阅读 · 2022年8月11日

Augmented Driver Behavior Models for High-Fidelity Simulation Study of Crash Detection Algorithms

Arxiv

0+阅读 · 2022年8月10日

Differentiable Inference of Temporal Logic Formulas

Arxiv

0+阅读 · 2022年8月10日

Incorporating social norms into a configurable agent-based model of the decision to perform commuting behaviour

Incorporating social norms into a configurable agent-based model of the decision to perform commuting behaviour

Arxiv

0+阅读 · 2022年8月10日

Interpolation and SAT-Based Model Checking Revisited: Adoption to Software Verification

Arxiv

0+阅读 · 2022年8月9日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

Rethinking Knowledge Graph Propagation for Zero-Shot Learning

Arxiv

17+阅读 · 2018年5月31日

相关基金

两类分数阶发展方程解的适定性及吸引子

国家自然科学基金

0+阅读 · 2015年12月31日

间充质干细胞在原发性胆汁性肝硬化中作用的探究

国家自然科学基金

0+阅读 · 2014年12月31日

牛蒡子中Arctignan A，Lappaol C及其衍生物的合成和抗白血病活性研究

国家自然科学基金

0+阅读 · 2014年12月31日

BAG3在慢性淋巴细胞白血病凋亡及迁移中的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

形变可积系统的怪波解及几何结构

国家自然科学基金

0+阅读 · 2013年12月31日

三维流形Heegaard分解稳定化问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于卟啉和酞菁为光敏剂的单线态氧氧化反应研究

国家自然科学基金

0+阅读 · 2012年12月31日

Degasperis-Procesi方程若干控制问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

“多胺-环糊精-量子点”纳米药物转运系统的设计及生物活性研究

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员