利用以自我为中心视频中的活动-文字前导来塑造代表代理行为 (Shaping embodied agent behavior with activity-context priors from egocentric video) - 专知论文

会员服务 ·

0

Performer · 奖励函数 · 塑造 · 粤港澳大湾区数字经济研究院 · 机器人 ·

2021 年 10 月 14 日

Shaping embodied agent behavior with activity-context priors from egocentric video

翻译：利用以自我为中心视频中的活动-文字前导来塑造代表代理行为

Tushar Nagarajan,Kristen Grauman

Complex physical tasks entail a sequence of object interactions, each with its own preconditions -- which can be difficult for robotic agents to learn efficiently solely through their own experience. We introduce an approach to discover activity-context priors from in-the-wild egocentric video captured with human worn cameras. For a given object, an activity-context prior represents the set of other compatible objects that are required for activities to succeed (e.g., a knife and cutting board brought together with a tomato are conducive to cutting). We encode our video-based prior as an auxiliary reward function that encourages an agent to bring compatible objects together before attempting an interaction. In this way, our model translates everyday human experience into embodied agent skills. We demonstrate our idea using egocentric EPIC-Kitchens video of people performing unscripted kitchen activities to benefit virtual household robotic agents performing various complex tasks in AI2-iTHOR, significantly accelerating agent learning. Project page: http://vision.cs.utexas.edu/projects/ego-rewards/

翻译：复杂的物理任务需要一系列物体相互作用,每个物体都有自己的先决条件 -- -- 机器人代理人很难仅仅通过自己的经历来有效地学习。我们采用一种方法来发现用人类破损的相机拍摄的、以自我为中心、以自我中心为中心、以人体磨损的录像所拍摄的活动前科。对于一个特定物体,活动前科代表了活动成功所需的其他兼容物体(例如,用刀和剪切板结合番茄有助于切割)。我们把以前以视频为基础的功能编码为辅助性奖励功能,鼓励代理人在尝试互动之前将兼容的物体聚集在一起。这样,我们的模型将日常人类经验转化成以自我为中心的代理技能。我们用以自我为中心的EPIC-Kitchens视频展示我们的想法,即从事无记名厨房活动的人的虚拟家庭机器人代理人在AI2-iTHOR从事各种复杂工作,大大加速代理学习。项目网页:http://vision.cs.utxas.edu/production/ego-rewards/

0

相关内容

Performer

【干货书】计算机科学家的数学，153页pdf

【干货书】计算机科学家的数学，153页pdf

专知会员服务

174+阅读 · 2021年7月27日

Effective.Modern.C++ 中英文版，334页pdf

Effective.Modern.C++ 中英文版，334页pdf

专知会员服务

68+阅读 · 2020年11月4日

【2020新书】机器学习概念与PythonJupyter本环境使用Tensorflow 2.0，301页pdf

【2020新书】机器学习概念与PythonJupyter本环境使用Tensorflow 2.0，301页pdf

专知会员服务

96+阅读 · 2020年9月24日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

计算机视觉最佳实践、代码示例和相关文档

计算机视觉最佳实践、代码示例和相关文档

专知会员服务

20+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

机器学习在材料科学中的应用综述，21页pdf

机器学习在材料科学中的应用综述，21页pdf

专知会员服务

49+阅读 · 2019年9月24日

.NET Core + Ocelot + IdentityServer4 + Consul 基础架构实现

.NET Core + Ocelot + IdentityServer4 + Consul 基础架构实现

DotNet

4+阅读 · 2019年5月25日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

已删除

将门创投

11+阅读 · 2019年4月26日

神器Cobalt Strike3.13破解版

神器Cobalt Strike3.13破解版

黑白之道

12+阅读 · 2019年3月1日

【TED】什么让我们生病

【TED】什么让我们生病

英语演讲视频每日一推

7+阅读 · 2019年1月23日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

视觉机械臂 visual-pushing-grasping

视觉机械臂 visual-pushing-grasping

CreateAMind

3+阅读 · 2018年5月25日

Python机器学习教程资料/代码

Python机器学习教程资料/代码

机器学习研究会

8+阅读 · 2018年2月22日

【推荐】视频目标分割基础

【推荐】视频目标分割基础

机器学习研究会

9+阅读 · 2017年9月19日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

TransFusion: Cross-view Fusion with Transformer for 3D Human Pose Estimation

Arxiv

0+阅读 · 2021年12月9日

Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering

Arxiv

0+阅读 · 2021年12月8日

E$^2$(GO)MOTION: Motion Augmented Event Stream for Egocentric Action Recognition

Arxiv

0+阅读 · 2021年12月7日

PSI: A Pedestrian Behavior Dataset for Socially Intelligent Autonomous Car

Arxiv

0+阅读 · 2021年12月5日

Pose2Room: Understanding 3D Scenes from Human Activities

Arxiv

0+阅读 · 2021年12月1日

Generative Video Transformer: Can Objects be the Words?

Arxiv

6+阅读 · 2021年7月20日

Building Intelligent Autonomous Navigation Agents

Arxiv

24+阅读 · 2021年6月25日

Activitynet 2019 Task 3: Exploring Contexts for Dense Captioning Events in Videos

Activitynet 2019 Task 3: Exploring Contexts for Dense Captioning Events in Videos

Arxiv

3+阅读 · 2019年7月11日

Streamlined Dense Video Captioning

Arxiv

7+阅读 · 2019年4月8日

Visual Semantic Navigation using Scene Priors

Arxiv

5+阅读 · 2018年10月15日

VIP会员

文章信息

相关主题

粤港澳大湾区数字经济研究院

相关VIP内容

【干货书】计算机科学家的数学，153页pdf

【干货书】计算机科学家的数学，153页pdf

专知会员服务

174+阅读 · 2021年7月27日

Effective.Modern.C++ 中英文版，334页pdf

Effective.Modern.C++ 中英文版，334页pdf

专知会员服务

68+阅读 · 2020年11月4日

【2020新书】机器学习概念与PythonJupyter本环境使用Tensorflow 2.0，301页pdf

【2020新书】机器学习概念与PythonJupyter本环境使用Tensorflow 2.0，301页pdf

专知会员服务

96+阅读 · 2020年9月24日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

计算机视觉最佳实践、代码示例和相关文档

计算机视觉最佳实践、代码示例和相关文档

专知会员服务

20+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

机器学习在材料科学中的应用综述，21页pdf

机器学习在材料科学中的应用综述，21页pdf

专知会员服务

49+阅读 · 2019年9月24日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICCV2025】具有局部对齐视觉-语言模型的可解释零样本学习

中国AI行业系列观察报告：穿越资讯迷雾，重塑AI认知

走向通用人工智能之路，世界模型为何不可或缺？

最新中文版7000字 | 无人机与作战革命：美国陆军致力于无人化时代

相关资讯

.NET Core + Ocelot + IdentityServer4 + Consul 基础架构实现

.NET Core + Ocelot + IdentityServer4 + Consul 基础架构实现

DotNet

4+阅读 · 2019年5月25日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

已删除

将门创投

11+阅读 · 2019年4月26日

神器Cobalt Strike3.13破解版

神器Cobalt Strike3.13破解版

黑白之道

12+阅读 · 2019年3月1日

【TED】什么让我们生病

【TED】什么让我们生病

英语演讲视频每日一推

7+阅读 · 2019年1月23日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

视觉机械臂 visual-pushing-grasping

视觉机械臂 visual-pushing-grasping

CreateAMind

3+阅读 · 2018年5月25日

Python机器学习教程资料/代码

Python机器学习教程资料/代码

机器学习研究会

8+阅读 · 2018年2月22日

【推荐】视频目标分割基础

【推荐】视频目标分割基础

机器学习研究会

9+阅读 · 2017年9月19日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

相关论文

TransFusion: Cross-view Fusion with Transformer for 3D Human Pose Estimation

Arxiv

0+阅读 · 2021年12月9日

Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering

Arxiv

0+阅读 · 2021年12月8日

E$^2$(GO)MOTION: Motion Augmented Event Stream for Egocentric Action Recognition

Arxiv

0+阅读 · 2021年12月7日

PSI: A Pedestrian Behavior Dataset for Socially Intelligent Autonomous Car

Arxiv

0+阅读 · 2021年12月5日

Pose2Room: Understanding 3D Scenes from Human Activities

Arxiv

0+阅读 · 2021年12月1日

Generative Video Transformer: Can Objects be the Words?

Arxiv

6+阅读 · 2021年7月20日

Building Intelligent Autonomous Navigation Agents

Arxiv

24+阅读 · 2021年6月25日

Activitynet 2019 Task 3: Exploring Contexts for Dense Captioning Events in Videos

Activitynet 2019 Task 3: Exploring Contexts for Dense Captioning Events in Videos

Arxiv

3+阅读 · 2019年7月11日

Streamlined Dense Video Captioning

Arxiv

7+阅读 · 2019年4月8日

Visual Semantic Navigation using Scene Priors

Arxiv

5+阅读 · 2018年10月15日

微信扫码咨询专知VIP会员