通过将模拟模拟与视频演示对齐学习操纵工具 (Learning to Manipulate Tools by Aligning Simulation to Video Demonstration)

A seamless integration of robots into human environments requires robots to learn how to use existing human tools. Current approaches for learning tool manipulation skills mostly rely on expert demonstrations provided in the target robot environment, for example, by manually guiding the robot manipulator or by teleoperation. In this work, we introduce an automated approach that replaces an expert demonstration with a Youtube video for learning a tool manipulation strategy. The main contributions are twofold. First, we design an alignment procedure that aligns the simulated environment with the real-world scene observed in the video. This is formulated as an optimization problem that finds a spatial alignment of the tool trajectory to maximize the sparse goal reward given by the environment. Second, we describe an imitation learning approach that focuses on the trajectory of the tool rather than the motion of the human. For this we combine reinforcement learning with an optimization procedure to find a control policy and the placement of the robot based on the tool motion in the aligned environment. We demonstrate the proposed approach on spade, scythe and hammer tools in simulation, and show the effectiveness of the trained policy for the spade on a real Franka Emika Panda robot demonstration.

翻译：将机器人无缝地融入人类环境需要机器人学会如何使用现有人类工具。学习工具操纵技能的现有方法主要依靠在目标机器人环境中提供的专家演示, 例如手动指导机器人操纵器或远程操作。在这项工作中, 我们引入了自动方法, 用Youtube视频取代专家演示, 用Youtube视频学习工具操纵策略。主要贡献是双重的。首先, 我们设计了一个匹配程序, 将模拟环境与在视频中观察到的真实世界场景相匹配。这是作为一个优化问题, 发现工具轨迹的空间对齐, 以最大限度地增加环境提供的稀薄目标奖励。第二, 我们描述一种模仿学习方法, 侧重于工具的轨迹, 而不是人类运动。为此, 我们将强化学习与优化程序相结合, 以找到控制政策, 并将机器人置于工具运动中, 在匹配环境中的位置上放置。我们演示了在模拟中 SPAde、 scythe 和锤子工具上的拟议方法。并展示了在真实的 Franka Emika Panda 机器人演示中经过培训的政策的有效性。

相关内容

TOOLS

关注 1

这个新版本的工具会议系列恢复了从1989年到2012年的50个会议的传统。工具最初是“面向对象语言和系统的技术”，后来发展到包括软件技术的所有创新方面。今天许多最重要的软件概念都是在这里首次引入的。2019年TOOLS 50+1在俄罗斯喀山附近举行，以同样的创新精神、对所有与软件相关的事物的热情、科学稳健性和行业适用性的结合以及欢迎该领域所有趋势和社区的开放态度，延续了该系列。官网链接：http://tools2019.innopolis.ru/

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

【SIGGRAPH 2020】人像阴影处理，Portrait Shadow Manipulation

专知会员服务

29+阅读 · 2020年5月19日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日